Gene Ent638_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2036 
Symbol 
ID5113452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2209100 
End bp2210308 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content57% 
IMG OID640492224 
Productmajor facilitator transporter 
Protein accessionYP_001176763 
Protein GI146311689 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.296988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.627164 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTGTA CGGAATACCC TGGCGCGTCA GCACCGCTAA CCTCTCGCAT GATTTTACTC 
TTTGCGACAC TCTGCGCCTT TGCCGTGGCG AACGTTTACA TGACGCAGCC GCTGCTGGAC
CAAATAGCCC TCTCGCTCGG CGAGAATGAA TCCAGAATGG GCATGATCAT CACCGCGACA
CAAACAGGTT ACGCGCTGGG GCTGATGCTG CTGGTTCCGC TTGGTGACCT GATTAACCGC
AAACGGCTGG TCACGCTCAT GTTACTGGCA AGCAGTGGGT TGTTGCTCGC TGCGTCCATG
GCCTCTTCTC TTTACGGGTT AAGCGGCATG CTGAGTCTGG TGGGCGCGAT GGCGGTAGTC
GTGCAAATCA TCGTGGCGTT TGCCGCCAGT CTGTCGGCCC CGGAAAAGCG CGGTCAGGTT
ACGGGCATCG TCACCAGCGG TGTGGTGATG GGTATTTTGC TCGCGCGTCT GGTCTCGGGC
TTTTTGGCGC AGTGGGCGGG CTGGAGAGTC GCGATTATGG TCTCGGCCGG GGCGATGTTC
CTGATGGCGT TGCTTTTCAT CCGTACTGCG CCGGATGAGC GTAAACAAGA CTCTACGCAA
TCGTACGGGC AACTGATGCT GTCGGTGTTT TCCCTGTGGC GTGAAATCCC CGCGCTGCGC
TCACGCGGCA TTCTGGCTTT GCTGATATTT ATGAATTTTA GCGTGCTATG GACATCCCTG
GTTTTCCCGC TCAGCCATGC GCCGTTTAGT CTGACCACTG CACAGATTGG TCTGTTCGGT
CTGGCAGGTA TTGCTGGCGC ACTGGCGGCA AGGCAAGCCG GAACCTTGGC CGATCGAGGC
CACGGACAAC ACGTCACCGG ATTCGCGCTG ATGTTGCTAT TGATCTCATG GATGGCGATG
GCATGGGGCG GCTCATCGCT AATTGCGCTG ACGTTAGGCA TCATCCTGCT GGACTTTGCC
GTGCAGGCGG TTCACGTCAC TAGCCAGAGC ATGATTTTTG CAACAAGACC GCAAGCCACC
AGCCGACTGG TGGCCGCTTA TATGTTTTTC TACTCTGTCG GCAGCGCCAT CGGTGCCCTG
CTCGCCACTC ACGTATGGAG TCAATTCGGC TGGACGGGCG TGTGTCTTTT GGGCGCGACA
ATTAGCGTAC TGGCGTTGCT TTACTGGCTG CTGATTGACA GGAGTCGGCA CAGCCAGACA
GCTGATTAA
 
Protein sequence
MTCTEYPGAS APLTSRMILL FATLCAFAVA NVYMTQPLLD QIALSLGENE SRMGMIITAT 
QTGYALGLML LVPLGDLINR KRLVTLMLLA SSGLLLAASM ASSLYGLSGM LSLVGAMAVV
VQIIVAFAAS LSAPEKRGQV TGIVTSGVVM GILLARLVSG FLAQWAGWRV AIMVSAGAMF
LMALLFIRTA PDERKQDSTQ SYGQLMLSVF SLWREIPALR SRGILALLIF MNFSVLWTSL
VFPLSHAPFS LTTAQIGLFG LAGIAGALAA RQAGTLADRG HGQHVTGFAL MLLLISWMAM
AWGGSSLIAL TLGIILLDFA VQAVHVTSQS MIFATRPQAT SRLVAAYMFF YSVGSAIGAL
LATHVWSQFG WTGVCLLGAT ISVLALLYWL LIDRSRHSQT AD