Gene Gura_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_0789 
Symbol 
ID5165297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp940606 
End bp941844 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content60% 
IMG OID640548287 
Productmajor facilitator transporter 
Protein accessionYP_001229570 
Protein GI148262864 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTCA TCATGTTGAC CGTGCTGATC GACATGGTGT CTATCGGTTT GATCATTCCC 
GTGTTGCCGT CGTTGGTCGG AAGTTTCACC GACTCACAAG CAAATCAGGC GTTCTGGTAT
GGCGTCGTGG TGTTCGCGTT CGGAATCGCG AATTTTTTCG CTTCGCCGAT CCTCGGTGCG
CTGTCCGACG CTTACGGTCG CCGCCCGTTG CTCTTGCTCG GTTTTTGCGG ACTCGGCCTC
AATTTTTTCG CAACAGGGCT CTCCACGGCG TTGTGGATGC TGATTGCGGT GCGGCTGGTG
GGCGGTGCGA TGCAGGCCAA TGCGGCCGTG GCTAACGCGT ATGTGGCGGA TATCACCGTT
CCCGAAGAGC GTGCCAGGCG TTTCGGCATG TTGGGCGCGA TGTTCGGCGT TGGCTTCATC
GTCGGGCCGG TGATGGGCGG GCTGCTGGGC GCAATCACCA TACAGCTCCC GTTTTTCGTC
GCCGGCGCCT TTGCAATGAT TAACTGGCTC TACGGCTATT TTGTGTTACC CGAGTCGCTC
CCTGCCGAGC GCCGGCGCCC ATTCCACTGG CGGATGGCAA ACCCGCTCGT GTCGCTACGC
GCGCTGACCC GGCTGAGCGG TGTCGGCCGA TTGGTCGCCG TGGTTGCGTT GAGCGGACTT
GCCCAATTCG TGCTGTTCAC CAGTTGGGTG TTGTACACGA CCTTCAAGTT CGGCTGGGGA
CCGCGTGAAA ACGGCTGGTC GCTCGCAGCG GTCGGCATCA TGTCGTTGGT CGTGCAGGGT
TTTCTGCTCG GACGGCTGCT GAAACGCTTT AGTCCGCGAC GCCTTGTGGT TGCCGGACTG
GCGTCGTCGT CGATCGCCTA CATATTGTGG GGCATAGCCA ACCAGGGCTG GATGATGTAC
GCAGTAATCT TCCTGAATCT GCTTAGCTAT ACGGTTACTG CGTCGCTGCA AAGCATAATT
TCCAGCGCCG CCGACTCCCA AAGCCAGGGG CAGGCGTTGG GGGCGGTCAA CTCCCTGAAC
AGCCTGATGG CGGTAGTGGC CCCCTTGTTC AGCACGCCGC TGCTTGCGAC GGTTTCCCAT
TTGCAGCGCG GCGATTGGCG CATCGGCGCG CCGTTCTATT TTTGCGCCCT GCTTCAAGCC
GCATCGCTGG CCTTGGCGTA TTTTCATTTC CGCAGCGAGC ACCATGCGAC GCCCGCGACG
GCGTCAGAAG TGCAGAGAGG CAGTGGGGGC AACCCTTGA
 
Protein sequence
MPFIMLTVLI DMVSIGLIIP VLPSLVGSFT DSQANQAFWY GVVVFAFGIA NFFASPILGA 
LSDAYGRRPL LLLGFCGLGL NFFATGLSTA LWMLIAVRLV GGAMQANAAV ANAYVADITV
PEERARRFGM LGAMFGVGFI VGPVMGGLLG AITIQLPFFV AGAFAMINWL YGYFVLPESL
PAERRRPFHW RMANPLVSLR ALTRLSGVGR LVAVVALSGL AQFVLFTSWV LYTTFKFGWG
PRENGWSLAA VGIMSLVVQG FLLGRLLKRF SPRRLVVAGL ASSSIAYILW GIANQGWMMY
AVIFLNLLSY TVTASLQSII SSAADSQSQG QALGAVNSLN SLMAVVAPLF STPLLATVSH
LQRGDWRIGA PFYFCALLQA ASLALAYFHF RSEHHATPAT ASEVQRGSGG NP