Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2149 |
Symbol | |
ID | 5734022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2706061 |
End bp | 2708229 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279290 |
Product | 5'-nucleotidase domain-containing protein |
Protein accession | YP_001544917 |
Protein GI | 159898670 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.70819 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTTG CAATGTGGCA TAACGTGAAA AGCAAAGTGG GCATCCTACT TGGCTTCACG CTGTTGGTCG GCTCGCTTGG CCAAGCTGCA CCAACGAAGG CCGCCGAAAA GCTGTGTTTC AACCAGCCCG GCGTTGTTGA ATGTATTGCT CCCGAGTTCC GCGATTACTG GGAAAAGAAC GGTGGGCTGC CCGTTTTTGG CTATCCCCAA ACCGCAGCCT ATGAAGAAGC CACTCCTGAA GGTAAGTTCT TGGTGCAATA TTTCGAGCGC CAACGGCTTG AATATCACCC CGAGAAACCA GCTCCATTTA CGATTTTGCT TGGCCGGATT AATGATGAAG TGCTGTTGCG CGAAAACCGT GTATGGCGCG ATTTCCCCAC TGCTCCCCAA GCGACTGGCT GCCAATTGTT CAGCGAAACT GGCCATAGCG TTTGTGGTGA GTTCTTGAAA TATTGGAACT CGCAAGGTTT GGATTTGGGC GAAAATGGCA TTACCTACGG CGAATCATTA GCCTTGTGGG GCTTGCCACT CTCTGATCCG CAAGAAGAAA TTAACATCGA TGGAGATAAA GTGTTGACCC AACACTTTGA GCGTGCTCGC ATGGAATGGC ACACCAAAGA TGGCAAGAAC CAAATCTTGC TGACCCGCCT TGGCGTGACC TTGGTGCCAA TGCAGCTCAA AATGTTGGCA ATCAACGACT TCCACGGCCA AATTTCAACG GGCCGCAAGG TGAGCAATAA AGATGTTGGT GGCGCTGCCT ACTTGAGCAG CTACATCAAA CAAGCTCGCG CCAAAGCTCG CTACTCGTTG ACCGTGCAAG CTGGCGATAT GGTCGGCGCA AGCCCACCAA GCTCAGCTTT GTTGCAAGAT CAGCCAACCA TGGAATTCCT CAATATGTTG GGAGTTAATG TTGGCACAAT CGGCAACCAC GAATTCGATG AAGGCTTCGA TGAAATGATG CGCTTGATCG ATGGTGGCTG TCACCCAACC GCTGGCTGCT GGGAAGGTGC AAACTATCCC TATGTTGTGG CCAACGTGAT CGACAAACGC ACCAATAAGA CAATTTTGCC AGCCTATCAT GTGATGAACA TCGATGGGGC ACGCATTGGC TTTATTGGCG TAGTATTAGA AAATACCCCT GAAATCGTGA TTCCATCAGG TGTGACCAAC CTTGAGTTTA TCGATGAAGT TACGGCGATC AACCAAGCAG TAACCGAGTT GAACGGCCAA GGTGTGCATG CAATCATTGT TTTGGCCCAC GAAGGTGGTA CGCAAAACGC CACAACTGGC GCAATCACTG GCCCAATTGC TGAAATTGCC AATGGCATTA ATGATGATGT TGATGTGATC GTTAGCGCTC ACACCCACAC CTCAATCAGC GGCGAAGTTG ATGGCAAGTT GATCACCCAA GCGCTTTCGT ATAGCACCGC ATTTGCTGAT ATCGATTTGA CAATCGACCG CGCCAAACGC GATATTGTCG CCAAAAAAGC GACGATCGTC ACGACCTTCC ACGAAGACAT GACCCCTGAT GCTGATGTTG CGGCAATGGT CAAGAAATAT GAAGACCAAG TAGCACCCCA AGTCAACCGC AAGGTTGGTA CTGCTGCTAG TGCGATCACC AACACGGCCA ATGCGGCTGG CGAATCGGCC TTAGGTAACC TGATTGCCGA TGCTCAACGT AACACCATGA GCACTCAATT TGCCTTTATG AACCCAGGTG GCATTCGTGC ACCACTCGAT GCTGGCGAAA TTACCTGGGG CGAGTTGTAT TCAATTCAGC CATTCAGCAA CGATTTGGTC AAGATGACCG TAACTGGGGC TGATATTTAC ACCTTGCTCA ACCAACAATG GCAAAACCAA AGCGATGGGA CAGTTCGCGC TCGTATCCTG CAAATTTCAG GTTTGAGCTA CACCTGGACT GATGCCAATC CTGTTGGTCA AAAGGTTGTC GAGGTGCTCG ATGGTAACGG CAAGGCTTTG GATAAAGCTG CGAGCTACAC GATCACCGTC AATAGCTTCT TGGCTGATGG TGGCGATGGC TTCGTTGTGC TCAAACAAGG CACCAATCGC GAAGTTGGCC CAACCGATCT CGATGGCTTC GTGCGCTACA TCGAAAAGTT AGCTCAGCCA ATCAGCGCCA ACATCGAAAA CCGTATCGTC AAACAATAA
|
Protein sequence | MDFAMWHNVK SKVGILLGFT LLVGSLGQAA PTKAAEKLCF NQPGVVECIA PEFRDYWEKN GGLPVFGYPQ TAAYEEATPE GKFLVQYFER QRLEYHPEKP APFTILLGRI NDEVLLRENR VWRDFPTAPQ ATGCQLFSET GHSVCGEFLK YWNSQGLDLG ENGITYGESL ALWGLPLSDP QEEINIDGDK VLTQHFERAR MEWHTKDGKN QILLTRLGVT LVPMQLKMLA INDFHGQIST GRKVSNKDVG GAAYLSSYIK QARAKARYSL TVQAGDMVGA SPPSSALLQD QPTMEFLNML GVNVGTIGNH EFDEGFDEMM RLIDGGCHPT AGCWEGANYP YVVANVIDKR TNKTILPAYH VMNIDGARIG FIGVVLENTP EIVIPSGVTN LEFIDEVTAI NQAVTELNGQ GVHAIIVLAH EGGTQNATTG AITGPIAEIA NGINDDVDVI VSAHTHTSIS GEVDGKLITQ ALSYSTAFAD IDLTIDRAKR DIVAKKATIV TTFHEDMTPD ADVAAMVKKY EDQVAPQVNR KVGTAASAIT NTANAAGESA LGNLIADAQR NTMSTQFAFM NPGGIRAPLD AGEITWGELY SIQPFSNDLV KMTVTGADIY TLLNQQWQNQ SDGTVRARIL QISGLSYTWT DANPVGQKVV EVLDGNGKAL DKAASYTITV NSFLADGGDG FVVLKQGTNR EVGPTDLDGF VRYIEKLAQP ISANIENRIV KQ
|
| |