Gene Haur_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2149 
Symbol 
ID5734022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2706061 
End bp2708229 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content50% 
IMG OID641279290 
Product5'-nucleotidase domain-containing protein 
Protein accessionYP_001544917 
Protein GI159898670 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.70819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTTG CAATGTGGCA TAACGTGAAA AGCAAAGTGG GCATCCTACT TGGCTTCACG 
CTGTTGGTCG GCTCGCTTGG CCAAGCTGCA CCAACGAAGG CCGCCGAAAA GCTGTGTTTC
AACCAGCCCG GCGTTGTTGA ATGTATTGCT CCCGAGTTCC GCGATTACTG GGAAAAGAAC
GGTGGGCTGC CCGTTTTTGG CTATCCCCAA ACCGCAGCCT ATGAAGAAGC CACTCCTGAA
GGTAAGTTCT TGGTGCAATA TTTCGAGCGC CAACGGCTTG AATATCACCC CGAGAAACCA
GCTCCATTTA CGATTTTGCT TGGCCGGATT AATGATGAAG TGCTGTTGCG CGAAAACCGT
GTATGGCGCG ATTTCCCCAC TGCTCCCCAA GCGACTGGCT GCCAATTGTT CAGCGAAACT
GGCCATAGCG TTTGTGGTGA GTTCTTGAAA TATTGGAACT CGCAAGGTTT GGATTTGGGC
GAAAATGGCA TTACCTACGG CGAATCATTA GCCTTGTGGG GCTTGCCACT CTCTGATCCG
CAAGAAGAAA TTAACATCGA TGGAGATAAA GTGTTGACCC AACACTTTGA GCGTGCTCGC
ATGGAATGGC ACACCAAAGA TGGCAAGAAC CAAATCTTGC TGACCCGCCT TGGCGTGACC
TTGGTGCCAA TGCAGCTCAA AATGTTGGCA ATCAACGACT TCCACGGCCA AATTTCAACG
GGCCGCAAGG TGAGCAATAA AGATGTTGGT GGCGCTGCCT ACTTGAGCAG CTACATCAAA
CAAGCTCGCG CCAAAGCTCG CTACTCGTTG ACCGTGCAAG CTGGCGATAT GGTCGGCGCA
AGCCCACCAA GCTCAGCTTT GTTGCAAGAT CAGCCAACCA TGGAATTCCT CAATATGTTG
GGAGTTAATG TTGGCACAAT CGGCAACCAC GAATTCGATG AAGGCTTCGA TGAAATGATG
CGCTTGATCG ATGGTGGCTG TCACCCAACC GCTGGCTGCT GGGAAGGTGC AAACTATCCC
TATGTTGTGG CCAACGTGAT CGACAAACGC ACCAATAAGA CAATTTTGCC AGCCTATCAT
GTGATGAACA TCGATGGGGC ACGCATTGGC TTTATTGGCG TAGTATTAGA AAATACCCCT
GAAATCGTGA TTCCATCAGG TGTGACCAAC CTTGAGTTTA TCGATGAAGT TACGGCGATC
AACCAAGCAG TAACCGAGTT GAACGGCCAA GGTGTGCATG CAATCATTGT TTTGGCCCAC
GAAGGTGGTA CGCAAAACGC CACAACTGGC GCAATCACTG GCCCAATTGC TGAAATTGCC
AATGGCATTA ATGATGATGT TGATGTGATC GTTAGCGCTC ACACCCACAC CTCAATCAGC
GGCGAAGTTG ATGGCAAGTT GATCACCCAA GCGCTTTCGT ATAGCACCGC ATTTGCTGAT
ATCGATTTGA CAATCGACCG CGCCAAACGC GATATTGTCG CCAAAAAAGC GACGATCGTC
ACGACCTTCC ACGAAGACAT GACCCCTGAT GCTGATGTTG CGGCAATGGT CAAGAAATAT
GAAGACCAAG TAGCACCCCA AGTCAACCGC AAGGTTGGTA CTGCTGCTAG TGCGATCACC
AACACGGCCA ATGCGGCTGG CGAATCGGCC TTAGGTAACC TGATTGCCGA TGCTCAACGT
AACACCATGA GCACTCAATT TGCCTTTATG AACCCAGGTG GCATTCGTGC ACCACTCGAT
GCTGGCGAAA TTACCTGGGG CGAGTTGTAT TCAATTCAGC CATTCAGCAA CGATTTGGTC
AAGATGACCG TAACTGGGGC TGATATTTAC ACCTTGCTCA ACCAACAATG GCAAAACCAA
AGCGATGGGA CAGTTCGCGC TCGTATCCTG CAAATTTCAG GTTTGAGCTA CACCTGGACT
GATGCCAATC CTGTTGGTCA AAAGGTTGTC GAGGTGCTCG ATGGTAACGG CAAGGCTTTG
GATAAAGCTG CGAGCTACAC GATCACCGTC AATAGCTTCT TGGCTGATGG TGGCGATGGC
TTCGTTGTGC TCAAACAAGG CACCAATCGC GAAGTTGGCC CAACCGATCT CGATGGCTTC
GTGCGCTACA TCGAAAAGTT AGCTCAGCCA ATCAGCGCCA ACATCGAAAA CCGTATCGTC
AAACAATAA
 
Protein sequence
MDFAMWHNVK SKVGILLGFT LLVGSLGQAA PTKAAEKLCF NQPGVVECIA PEFRDYWEKN 
GGLPVFGYPQ TAAYEEATPE GKFLVQYFER QRLEYHPEKP APFTILLGRI NDEVLLRENR
VWRDFPTAPQ ATGCQLFSET GHSVCGEFLK YWNSQGLDLG ENGITYGESL ALWGLPLSDP
QEEINIDGDK VLTQHFERAR MEWHTKDGKN QILLTRLGVT LVPMQLKMLA INDFHGQIST
GRKVSNKDVG GAAYLSSYIK QARAKARYSL TVQAGDMVGA SPPSSALLQD QPTMEFLNML
GVNVGTIGNH EFDEGFDEMM RLIDGGCHPT AGCWEGANYP YVVANVIDKR TNKTILPAYH
VMNIDGARIG FIGVVLENTP EIVIPSGVTN LEFIDEVTAI NQAVTELNGQ GVHAIIVLAH
EGGTQNATTG AITGPIAEIA NGINDDVDVI VSAHTHTSIS GEVDGKLITQ ALSYSTAFAD
IDLTIDRAKR DIVAKKATIV TTFHEDMTPD ADVAAMVKKY EDQVAPQVNR KVGTAASAIT
NTANAAGESA LGNLIADAQR NTMSTQFAFM NPGGIRAPLD AGEITWGELY SIQPFSNDLV
KMTVTGADIY TLLNQQWQNQ SDGTVRARIL QISGLSYTWT DANPVGQKVV EVLDGNGKAL
DKAASYTITV NSFLADGGDG FVVLKQGTNR EVGPTDLDGF VRYIEKLAQP ISANIENRIV
KQ