Gene Ent638_2050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2050 
Symbol 
ID5113466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2222846 
End bp2224063 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content52% 
IMG OID640492238 
ProductMFS transport protein AraJ 
Protein accessionYP_001176777 
Protein GI146311703 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATA GCTGTTGCAA ATTTTTAACA AATGGCTGTG GTATGAAAAA AACCGTTTTC 
TCGTTAGCAC TGGGCACGTT TGGTCTGGGA ATGGCTGAAT TTGGCATTAT GGGTGTGTTA
ACTGAGCTGG CCCGTGACAC CGGTATCACG ATCCCTTCCG CCGGAAGCAT GATTTCGTAT
TACGCGTTCG GGGTGGTTAT TGGCGCACCG ATCATTGCGC TTTTTTCAGG CAAGTTCTCG
TTAAAAACGA TCCTGCTCTT CCTGGTCGCG CTGTGTGTCG TTGGGAATGT CATTTTTACT
TTTTCCACCT CCTATCTCTG GCTGGCCATC GGGCGACTGG TGTCCGGTTT TCCCCACGGT
GCATTTTTCG GCGTCGGGGC GATCATCCTG TCTAAAATTG CGCCACCGGG CAAAGTGACC
CTTGCCGTAG CAGGAATGAT TGCCGGCATG ACGGTCGCGA ATCTGGTTGG GGTTCCGCTG
GGGACCTGGC TCGGACATGA ATACAACTGG CGATACACTT TTTTCCTGAT TGCCGCGTTT
GATGCGCTGG TGATCCTGTC GATTATTTTC TGGGTTCCGA CGCTGTACGA CAAATCGGAT
GCCAGACTGA TCGAGCAATT CCACTTTTTG AAAAAGCCGG AACCCTGGTT AATTTTTGCC
GCCACCATGT CTGGCAATGC GGGCGTGTTC GCGTGGTTCA GCTTTGTGAA GCCGTTTATG
GTGAATGTTT CCGGATTTTC GGAAGCCTTC ATGACCGTCA TTATGATGCT GATGGGGCTG
GGAATGGTGT TGGGGAATAT GCTGAGCGGC AAATTATCCG GACGCTACAG TCCGCTGCGC
ATTGCCGCTA CCACCGATCT GGTCATCGTC CTGTCGCTGG TGGCGCTCTT TGCCTTTGGC
GAGCACAAAT CGGCCTCGCT GGTGATGGGC TTTATCTGCT GTGCAGGCCT TTTTGCGCTC
TCTGCGCCAC TGCAAATTCT GTTGCTGCAG AATGCGAAAG GCGGCGAATT ACTGGGAGCA
GCAGGGGGGC AGGTGGCGTT TAATCTCGGC AGCGCCATCG GTGCCTATTT TGGTGGAATG
ATGATCACAC TTGGCTACAG CTGGAGCTAC GTTACGCTCC CCGCCGCCAT TCTGTCGTTT
GCGGCCATGT CTTCGTTACT GATGTACGGC TATAAAAAAG GCAAACGTAA TCAGGCGAAT
GACAGCGCGT TGGCGTAA
 
Protein sequence
MNNSCCKFLT NGCGMKKTVF SLALGTFGLG MAEFGIMGVL TELARDTGIT IPSAGSMISY 
YAFGVVIGAP IIALFSGKFS LKTILLFLVA LCVVGNVIFT FSTSYLWLAI GRLVSGFPHG
AFFGVGAIIL SKIAPPGKVT LAVAGMIAGM TVANLVGVPL GTWLGHEYNW RYTFFLIAAF
DALVILSIIF WVPTLYDKSD ARLIEQFHFL KKPEPWLIFA ATMSGNAGVF AWFSFVKPFM
VNVSGFSEAF MTVIMMLMGL GMVLGNMLSG KLSGRYSPLR IAATTDLVIV LSLVALFAFG
EHKSASLVMG FICCAGLFAL SAPLQILLLQ NAKGGELLGA AGGQVAFNLG SAIGAYFGGM
MITLGYSWSY VTLPAAILSF AAMSSLLMYG YKKGKRNQAN DSALA