Gene Haur_4661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4661 
Symbol 
ID5736508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5958232 
End bp5959596 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content53% 
IMG OID641281825 
Productmajor facilitator transporter 
Protein accessionYP_001547420 
Protein GI159901173 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.731341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAAG CAACCAGAAA GCAACCGAGC GGCATGCTCG CATTTAGCAT TATGTGGTTT 
GGCCAAGTCG TTTCATTGCT TGGCAGCTCC ATGAGCAGCT TTGCCCTGAC GATTTGGGCT
TGGCAAATTA CAGGTCAAGC CACAGCCTTG GCGCTCGTAG GCTTTTTCTC GTTTGCCCCA
AGCATTATTG TTAGCCCCTT TGCCGGAGCC TTGGTCGATC GCTGGAATCG TAAGCTGGTG
CTGATTTTGA GCGATTTAGC CACAGGCTTA TCGACGATCG CTATTTTATT GCTCTACCAC
AACGATGTAC TGCAAATTTG GCATTTGTAT GTGGCTGGAG CATTTGCCAG CATTTTTCAA
TCGTTTCAAT GGCCAGCCTA TTCGGCGGCA GTTTCGACGA TGTTGCCCAA ACAGCACTAT
GCTCGCGCCA GCGGCATGAT GTCGATGGCC GAATCGGCAG CGGGAATTGT CGCGCCAGCC
CTAGCCGGCT TTTTGCTAAC CGTAATGGGC ATTGGCGGTA TCTTGATTAT TGATATTGTG
ACGTTTGTGT TTGCCGTCAG TGCAGTGCTC TTTGTTAATA TTCCCCAACC AACGCAGAGC
GAGGCGGGGG CGCAAGGCAA GGGCAGTTTA TGGAGCGAGG CAGGCTTTGG CTTTCGCTAT
ATTTTGGCAC GCCCCAGCCT CTTGGGCTTG CAACTGACCT TTTTTATGAT TAACTTCGTT
GGTTCGTTTG AAGCCACCAT GACCGCTCCC ATGATTCTGG CCCGCACCGA TAGTAACTCG
GCAATTATGG GCACGGTACA ATCGGCAATG GGCATTGGCG GAGTGATCGG CGGCTTGATC
CTGAGTGTCT GGGGCGGCCC CAAACGCAAA GTTCATGGCG TGCTCGGTGG AATGGCGCTC
TCCAGCTTTT TTGGCGGCAT TCTCATGGGC TTGGGCCAAA ATACGCTGGT TTGGTCGATT
GCGGGCTTTG GTTTGCTATT CGTGCTACCA ATGTTGAATG GCTCGAATCA AGCGATTTGG
CAAGCCAAAG TGCCACCCGA TATCCAAGGG CGGGTGTTCG CAGTGCGACG CATGATCGCC
CAAATTTCGG GGCCAATCGC AATTTTGATC GTTGGACCAT TGGCCGACAA AGTGTTTGAG
CCACGTATGG CGGTTGGCGG CGCTTGGGTC GATATGTTTG ACAGTTGGGT TGGCAGTGGC
AAAGGGGCGG GCATCGCCTT AATTATGGTC TTGAGTGGCA TTGTTGGCAT CGCCGTAGCC
GTGATCGCTT ATGGCGTGCG AGTCGTGCGC CACGCCGAGG ATCTAATTCC TGACCATCAA
GATAGCCCAA GTAGCAGCCC TGAACTGCAA GCCGAACCAG CCTAA
 
Protein sequence
MAEATRKQPS GMLAFSIMWF GQVVSLLGSS MSSFALTIWA WQITGQATAL ALVGFFSFAP 
SIIVSPFAGA LVDRWNRKLV LILSDLATGL STIAILLLYH NDVLQIWHLY VAGAFASIFQ
SFQWPAYSAA VSTMLPKQHY ARASGMMSMA ESAAGIVAPA LAGFLLTVMG IGGILIIDIV
TFVFAVSAVL FVNIPQPTQS EAGAQGKGSL WSEAGFGFRY ILARPSLLGL QLTFFMINFV
GSFEATMTAP MILARTDSNS AIMGTVQSAM GIGGVIGGLI LSVWGGPKRK VHGVLGGMAL
SSFFGGILMG LGQNTLVWSI AGFGLLFVLP MLNGSNQAIW QAKVPPDIQG RVFAVRRMIA
QISGPIAILI VGPLADKVFE PRMAVGGAWV DMFDSWVGSG KGAGIALIMV LSGIVGIAVA
VIAYGVRVVR HAEDLIPDHQ DSPSSSPELQ AEPA