Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2980 |
Symbol | |
ID | 5734852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3760289 |
End bp | 3762613 |
Gene Length | 2325 bp |
Protein Length | 774 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280124 |
Product | hypothetical protein |
Protein accession | YP_001545746 |
Protein GI | 159899499 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAGCC GACGTGCTGG AATGTTAATT ATTGCCATTC TCAGTTTGGC GCTGCTGGTT GGCTTGATTG GCTATGCCTT GTTGGTGCAG CCAGTTGTCG CCCAACTTTC ACCAACGCCA CCAACCCTCT TTACCCAGTT GCAATACCCT TTCAATCAGA GTTTGATTCC AGTGGGCAAA GTGCTGACGG TGCATTCGCG CTCGTGGGGC AGCCAGCCGA TTGAGAACGT CGAATTATGG GCTGATGGTC AGCCATGGGC GCTGCAAGCA GCCAATAATG CAACGCTATA CGAAGGCTTA TTTGCTTGGC AATCTCTTGG CTTGGGCGAC CACAGTTTGG TTGCTCGCAC CAACGATACC AGCAAAATTC CCTCAACTTC GGCGATTGTG CGCTTGAATG CTGTGATGCC ATATGAGCCT GTTGCGACCT TCCAAGTGCA ACAGTCAGCA GGTGAAAGCC TGGAAAGCCT GAGCAAAGCC TTTGACGTTG CGCCCCAAAC TCTGCTGAAG CTCAACCCAC AGCTTGGCAA TTTGCCCTTG AACCAGCCCT TGGGCAGCGA TCAATCGATT ACGATTCAGT TGGCTGGTCA ATTTACCCCC ATCAGCACTA CCACCAGCCT GAGCGCGACT CAAACGCCAC TTCCGGCTGA TGTTGCCAAT TTGCCGTTGG CTACACCCTA CGACCCCAAT GCAATCTGGT TTGATTTGCA GCGCCGTTTT GGCACAGCCA ATGTGCCGTT AGCTCCTGAG GCGCTGGTTG GCAGCAGCAA TTGCCAAACC CAGTTGGTGT TTACGCCCAC ATCAGATGAT GCTGATGGCT TTTTTATCTA TCGGGCTGGG CCAACGCAAA ACCATTTTGA GTTGGTGGCA ACCGTTGCCG CCAATGGTTC TGGCCCACAG CTTTGGCAAG AACCAGCCGA TTTTGGCCAA ACGCTCTATT ATGTGGTGGC GTTTAATCCA GCTGGGGCAG CGGCTAGCCC AGTGGTGGCA CTAGAGAATG CTGATTCGGC TTGTGCCACG TTGCAAGTGC CTCAATTTGC CAGTTTGCTG CTCACGCCAA AAATAGCCGT CAGCGATGTT TATTGTTATA GCTCGTTAGC AAACGGCCCA TGGCAACGAG TGCCCAATCA AGGCTTTTTG CTGCCAATTG CTGGCGGCTA CGATTTAGCC AGCGCTTTAC CACTAACCGC GATCAGCGCC AGCCAAACTT GGAATTTGGA ATGCTGGGGC TGGAATGGCA CGCAGCCACA ATTATTGGGC AACACCAGCA CCACATTAAG TCTTGGGCAG GAGCAAATAA TTGCCCTCGA TGCCGATTTA TTCAGTTTGC AAGGCCAATT GAACACTAGC ACCCAAATTC AGCAAGTGCC TACGCTCAGC ACAATTGCGC CGCCAACCAA CCTTCGCTTA ACCACCGATC TTGAGGAGTG TGTGCAAGCA GCGCCGCAAG CTGATGATTT TTGGCGGACG GCTTGTAGCA CCAATCTGGC GGCGGGAGCG ACCGTTCTAA CCTGGCAATG GAGCGAACAG GCCTGTTTTC CAAGCGCTGA CGGCCAAGAT TGTAGTGCCA ACGCCAATCT TGAAGGCTTT CAAATTAACG ATCGCTTGGC TGGAACGCCG CTCGAATTAA CCCGCGTTAA TCCTGAACAA CGCTTGGTGT TTTTGGCTCC ACGTACCATG CCCAGCCCAA CCGATGAATG TTTGAGTGTG CAAGCTTTTC GTGGTTTGGC CGTTTCGCTG GATAGTGAAG TGCTCTGTTT GCCAGCGCTT AAATTGGCAG CAGGTAGCTA TACCCTCGCC CCAAGTTTGT TTAATCTGAA TGCAGGCGTG GCTCAGCAAA CGGTTGGTGA TGGCTGCCCA GCCTTGCCGA TCAACCAAAG CCAATCAACC AGCCTGAATT ATCCCTATGT CAGTAGTTTG TTGCTACTTC AGCGCAATGC TGTGGCCGAA ACTGCCTGTC GGCGCTACAC GGCCAGCTGG TTTGATGGCA GTGTGAGTTT TATTTTGCCG CGACTCGATC AATCAGTTGG TGGCTTAGAA TTAAGTTTTA GTGTCGCAAG TCAACCTGAG CCAGCCAACG AAGCCTGCCC CGCCAGCGCC AGCCTGCAAA TCGGGAGCGC TAGCCAAACA ATTGATTTAG CAACTTGGCC CACGAATGGC TTGGTTACAG TTGCTGTTGA TCAAGCCTTG CTCGAACAAA TTGGGCAATC GAATGACCCC AAATTGAACT TTACACTTCA AGCCGAGCGC CAACTTGCCA ATCGCAACCA ACGTTGCAGT AGCGAGCTAG GCGATTTTTC ATTGAAACTA ACGGTGCAAC CATGA
|
Protein sequence | MMSRRAGMLI IAILSLALLV GLIGYALLVQ PVVAQLSPTP PTLFTQLQYP FNQSLIPVGK VLTVHSRSWG SQPIENVELW ADGQPWALQA ANNATLYEGL FAWQSLGLGD HSLVARTNDT SKIPSTSAIV RLNAVMPYEP VATFQVQQSA GESLESLSKA FDVAPQTLLK LNPQLGNLPL NQPLGSDQSI TIQLAGQFTP ISTTTSLSAT QTPLPADVAN LPLATPYDPN AIWFDLQRRF GTANVPLAPE ALVGSSNCQT QLVFTPTSDD ADGFFIYRAG PTQNHFELVA TVAANGSGPQ LWQEPADFGQ TLYYVVAFNP AGAAASPVVA LENADSACAT LQVPQFASLL LTPKIAVSDV YCYSSLANGP WQRVPNQGFL LPIAGGYDLA SALPLTAISA SQTWNLECWG WNGTQPQLLG NTSTTLSLGQ EQIIALDADL FSLQGQLNTS TQIQQVPTLS TIAPPTNLRL TTDLEECVQA APQADDFWRT ACSTNLAAGA TVLTWQWSEQ ACFPSADGQD CSANANLEGF QINDRLAGTP LELTRVNPEQ RLVFLAPRTM PSPTDECLSV QAFRGLAVSL DSEVLCLPAL KLAAGSYTLA PSLFNLNAGV AQQTVGDGCP ALPINQSQST SLNYPYVSSL LLLQRNAVAE TACRRYTASW FDGSVSFILP RLDQSVGGLE LSFSVASQPE PANEACPASA SLQIGSASQT IDLATWPTNG LVTVAVDQAL LEQIGQSNDP KLNFTLQAER QLANRNQRCS SELGDFSLKL TVQP
|
| |