Gene Haur_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2119 
Symbol 
ID5734007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2661056 
End bp2662570 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content54% 
IMG OID641279260 
ProductABC transporter related 
Protein accessionYP_001544887 
Protein GI159898640 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.307687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATT CGCAGAGTCT TATCACCGTC AGAGGTATTT CCAAAGCGTT TCCTGGTGTC 
GTCGCCCTTG AACGTGTTGA TTTTACGCTG AGGAAAGGCG AAATCCACGC GCTCATGGGC
GAAAATGGTG CCGGAAAATC TACCCTGATC AAAATTCTCA CTGGGGTGGA GCAGCCCGAC
ACGGGCAGTA TTGAATTAGA GCACACGCTT ATCCGCATTC GGTCGCCGCA GCATGCCCAA
CAACTTGGCA TCAGCACGGT GTATCAAGAG ATCAATCTGT GCACCAATAT TTCGGTCGCC
GAAAATATTA TGCTCGGCCA TGAGCCACGG CGCTTTGGTG GCATCCACTG GAAAAAGCTG
AACGAAATGG CCCGCCAAGC GCTCCTGCGC TTAGGCATCG AGCTGGATGT GACCCAACCA
CTGGGAATGT ACTCGATCGC GATCCAGCAA ATGGTCGCGA TTACCCGGGC GCTTGAGATC
GCGTCGGCCA AAGTATTGAT TCTGGATGAA CCGACATCCA GCTTGGATGC CAACGAAACT
GCACGGCTTT TTGCTGTGAT GCGGGCGCTC AAGCAAGCGG GCATCGGGAT TGTCTTTGTC
ACGCATTTCC TCGACCAAGT GTACGAAATC GTTGATCGAG TGACGATTCT GCGCAACGGC
AGTTTCGTTG GCAGCTACGA CATTGCCGAT CTCCCACGGG TTGAACTGGT AGCAAAAATG
CTTGGTCGCG TAGTTGGCGA ATTGCAAGCA TTGGCCCAAG ATAAAGCGCG AGGTGAGCGT
CAGCCCGATG AACGACGGCT GATCGAAGCG GCTGGTCTGG GCCTGAGCGG CATGCTGGAG
CCGCTTGATC TGGCTATTAA CGTCGGCGAA GTGCTGGGGA TTGCAGGGTT GCTCGGCTCC
GGACGCAGCG AACTGGCAAG TTTGCTGTTT GGTCTGACTT CGCCTGATAG TGGCACGTTG
CTGATCGATG GGCAGTCAGT TACGCGGTTT TCCCCACAAG AATCGATCAA GCGTGGGGTC
GCTTTGTGTC CCGAGGATCG CAAGGCTGAG GGTATTGTAG GCGATCTTAG CATTCGCGAA
AATATTATTT TGGGCTTGCA AGGGCGCTAT GGCTGGTTCA AATTCATTAA CAGGCAGAAG
CAGGATGAAA TCGCCGAGAA ATACATCAAG CTGCTTGGCA TTCGCACGCC ATCGCCCGAC
CAGCTGGTCA AAAACTTAAG TGGCGGCAAT CAACAAAAGG TCATCTTGGC CCGCTGGCTC
GTTACGCAGC CACGCCTGCT GATTTTGGAT GAGCCGACAC GGGGCATTGA TGTCGGTGCC
AAGGCTGAGA TTCAAAAGCT GGTACTGGAA CTCGTCAAGG ATGGCATGTC AATCGTCTTT
ATTTCCTCTG AGTTGGAAGA GGTGGTACGT ATCAGCGACC GGATTGTGGT CTTGCGCGAC
CGCGCGAAAG TGGCCGACTA CGACCACGCG GTGAGTGATC GGACGCTTAT TCAAACGATG
GCAGGTGAGG CATGA
 
Protein sequence
MADSQSLITV RGISKAFPGV VALERVDFTL RKGEIHALMG ENGAGKSTLI KILTGVEQPD 
TGSIELEHTL IRIRSPQHAQ QLGISTVYQE INLCTNISVA ENIMLGHEPR RFGGIHWKKL
NEMARQALLR LGIELDVTQP LGMYSIAIQQ MVAITRALEI ASAKVLILDE PTSSLDANET
ARLFAVMRAL KQAGIGIVFV THFLDQVYEI VDRVTILRNG SFVGSYDIAD LPRVELVAKM
LGRVVGELQA LAQDKARGER QPDERRLIEA AGLGLSGMLE PLDLAINVGE VLGIAGLLGS
GRSELASLLF GLTSPDSGTL LIDGQSVTRF SPQESIKRGV ALCPEDRKAE GIVGDLSIRE
NIILGLQGRY GWFKFINRQK QDEIAEKYIK LLGIRTPSPD QLVKNLSGGN QQKVILARWL
VTQPRLLILD EPTRGIDVGA KAEIQKLVLE LVKDGMSIVF ISSELEEVVR ISDRIVVLRD
RAKVADYDHA VSDRTLIQTM AGEA