Gene Haur_0796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0796 
Symbol 
ID5732681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp899040 
End bp900215 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content53% 
IMG OID641277927 
Productmajor facilitator transporter 
Protein accessionYP_001543572 
Protein GI159897325 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGCT CTCCACTTTT GACCATTTTT TTGATCGCGT TTGTTGGAAC CATGAGCTAT 
GGTGTGGTGA TTCCGATCAC GCCGTTTTAT GCGCAGGAGT TTGGGGCTTC CGAGGTTCAG
GTGGGCATGA TTGTAGGCAG CTATGCCTTG ATGCAATTTA TCTTTGCCCC GATCCTTGGC
CAATTATCCG ACCGCTATGG TCGTCGCCCA CTGCTGATTT TAAGTTTGAT TGGCACGGTT
TGTAGTTTGT TGCTATTTGG TTTTGCCAAT AGCCTGATTT GGCTGTTCGT CGGGCGCATG
TTCGATGGCG CAACTGGCGG TAACATCTCG ATTGCCCAAG CCTATGTTAG CGATATCACC
ACCGACAAAG ATCGTGCTCG CGGGATGGGC ATGGTTGGGG CGGCACTTGG CTTAGGCTTT
ATCGCTGGCC CAGCCATCGG CGCGTTGCTC AGCAAAGATG GCAATTATCA GTTGCCAATT
TTCGTAGCCG CAGGCATTGC AGTGCTCAGC CTGATTTTAA CGATTGTGGT ATTGCCTGAG
CCAGAGCGCC ATGCACCTCA ACAAGGCCGT ACTTTTAACC CAATGAAACT GCTGGCGGCA
GTTCGCAAGC CCAATGTTGG CCGTTTGCTC AGTATTACCT TGTTGATCAA CTTGGCATTT
GTGGCCTTTG AAACAACTTT TGCCTTGTTT GCGGCGCGAC GGTTGGAGTT TGGCTCGCAT
CAAACAGGCT ATACTTTGGC CGGGGTTGGG ATTGTGGTCG CGATTGTGCA AGGCGGCTTA
ATTCGCCGTT TGGCGGCGCG GTTTGGCGAA GCAACCCTGA TTGTGTCTGG CTCGTTGCTG
CTCGCGCTTT CGTTGGCGGG CTTGGGCTTT ATTCAAAATG TGTGGCATTT GGTGGCAATT
TGTATTGTGC TGGCAGTTGG CGAGGGCTTG CTCACGCCAT CGCTTTCGTC GTTGGTCAGC
CGCAATTCAC CTGCTAGCGA GCGCGGCGAG AATATGGGCT TGTATCAGTC GATGAGCAGT
TTGGCGCGGA TTTTTGCCCC GCTCTATGCC ACCTGGATGC TCTCGAACGT TGGCGAAGCC
TCGCCCTACC TGATGGGCAG CGTGTTGGTT GTGGCAGGCG CATTAATTGC GGTTGGCTTG
CCTAGCCCTG AACCGCAAGC CCAGCCAGCG CATTAG
 
Protein sequence
MKRSPLLTIF LIAFVGTMSY GVVIPITPFY AQEFGASEVQ VGMIVGSYAL MQFIFAPILG 
QLSDRYGRRP LLILSLIGTV CSLLLFGFAN SLIWLFVGRM FDGATGGNIS IAQAYVSDIT
TDKDRARGMG MVGAALGLGF IAGPAIGALL SKDGNYQLPI FVAAGIAVLS LILTIVVLPE
PERHAPQQGR TFNPMKLLAA VRKPNVGRLL SITLLINLAF VAFETTFALF AARRLEFGSH
QTGYTLAGVG IVVAIVQGGL IRRLAARFGE ATLIVSGSLL LALSLAGLGF IQNVWHLVAI
CIVLAVGEGL LTPSLSSLVS RNSPASERGE NMGLYQSMSS LARIFAPLYA TWMLSNVGEA
SPYLMGSVLV VAGALIAVGL PSPEPQAQPA H