Gene Haur_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1033 
Symbol 
ID5732937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1178881 
End bp1180164 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content52% 
IMG OID641278168 
ProductABC transporter related 
Protein accessionYP_001543809 
Protein GI159897562 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGA TTGCGATTCA GGTTGAGCAT TTAAGTAAGC AATTTCGCAT TGGGGCGAAT 
GCATTTCAAA AAACCTTTCG CGAAACAGTG TCGGATATGA TGTTAGCGCC AGCCCGCCGC
TTGCGTTCAG CCATGCGTGG TCAAGGCAGC CAAGCTGCGA CCAAGGAATT TTGGGCGCTG
CGCGATGTTT CATTCGATGT TAAGCAGGGC GAAGTCGTTG GGATTATCGG CCACAACGGC
GCGGGCAAAA GTACCTTATT GAAAGTGCTT TCGCGGATTA CCGAGCCAAC CACTGGCTCA
GCCTTGATTC GTGGGCGGGT TGGCACGCTG CTTGAAGTTG GCACGGGTTT TCATCCCGAC
CTCACAGGTC GCGAAAATAT GTATCTCAAC GGAGCGATTT TGGGCATGAG CCGCGCCGAA
ATTAACCGTA AGTTCGATGA AATTGTGGCC TTTGCTGAGG TCGAGCAATT TATCGACACC
ATGGTTAAAC ATTATTCCAG CGGCATGTAT TTGCGCTTAG CCTTTGCGGT GGCCGCCCAC
CTTGAGCCAG AAATTCTGAT TGTCGATGAG GTTCTGGCGG TTGGCGATGC TCAATTTCAA
AAGAAATGCT TGGGCAAAAT GGGCGAAGTT GCCCAGAATG GCCGGACTGT GCTCTTTGTA
AGCCACAATA TGGCGGCAAT TCGCAGCTTA TGTCAACGAG TGGTTTGGCT CAACCAAGGC
ACTGTGCTCA AAGATGGCCC TTCGGCAGCA ATTGTCAACG AATATTTGAT GCAAACCGTC
ACTTCAAGCA GCTCTCGCGT CGATCTGCGC GAGGCAACTC GCCATTATGA TTATGGTAAG
CGCTTTAAGA TTAATGACCT GACCTTTAAC CACGGCGAGC CAATTTTGCA TGGCGAGGTG
CTCAACGTCA GCTTCAACTA CGAAGCCTAT GCCGATGTCT ATGGCGTGTC GTTTGGCTAT
GGGTTTTCCT CACTCGAAGG CACGCGCCTA ATGACGGTCG ATAGCGATTT GAATGCACCA
CGCTACCTTA TTAAGCGCGG CCAGCACGGC CAACTCACCA GTACCCTCGA TACGCTCAAC
TTGCAACCTG GCATCTATTT GCTTGATGTT GGGGTGCGTT CAGGTGATGG TAGTGCGTTG
GATTACCTGC CTGGATGTGC TCAGGTTGAA ATTTTGCCTG GCCCGACTAC CCCAGCTTCA
ATGAGCCGCT TGGATTATGC CGGTAATGTA CGGCTTGGCG GGCAGTGGCA ATGGCCTGAG
ACGGCCCAAG AGGATCGCGA TTAA
 
Protein sequence
MSEIAIQVEH LSKQFRIGAN AFQKTFRETV SDMMLAPARR LRSAMRGQGS QAATKEFWAL 
RDVSFDVKQG EVVGIIGHNG AGKSTLLKVL SRITEPTTGS ALIRGRVGTL LEVGTGFHPD
LTGRENMYLN GAILGMSRAE INRKFDEIVA FAEVEQFIDT MVKHYSSGMY LRLAFAVAAH
LEPEILIVDE VLAVGDAQFQ KKCLGKMGEV AQNGRTVLFV SHNMAAIRSL CQRVVWLNQG
TVLKDGPSAA IVNEYLMQTV TSSSSRVDLR EATRHYDYGK RFKINDLTFN HGEPILHGEV
LNVSFNYEAY ADVYGVSFGY GFSSLEGTRL MTVDSDLNAP RYLIKRGQHG QLTSTLDTLN
LQPGIYLLDV GVRSGDGSAL DYLPGCAQVE ILPGPTTPAS MSRLDYAGNV RLGGQWQWPE
TAQEDRD