Gene Ent638_3660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3660 
Symbol 
ID5111908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3968202 
End bp3969689 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content58% 
IMG OID640493865 
Productputative sialic acid transporter 
Protein accessionYP_001178368 
Protein GI146313294 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00891] putative sialic acid transporter 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.723795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.17962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATTC CTACCCAAAG CATCCCGTGG TACCGCCATC TCAGCCGCCC GCAGTGGCGC 
GCCTTCTCCG CCGCATGGTT GGGATATCTG CTGGATGGTT TCGATTTTGT ATTGATAGCG
CTGGTGTTAA CGGAAGTTAA AACGGAATTT GGACTGACAA CGGTCGAGGC GGCTAGCCTC
ATATCCGCTG CCTTTATCTC CCGATGGTTT GGCGGCCTGC TGCTTGGCGC TATGGGCGAT
CGCTACGGGC GGAAACTGGC GATGATCACC AGTATCGTGC TGTTTTCGTG CGGGACGCTG
GCGTGCGGTT TTGCGCCAGG CTACATCACG ATGTTTATTG CCCGCATGGT GATCGGCATG
GGCATGGCGG GCGAATACGG CTCCAGCGCC ACCTACGTGA TTGAAAGCTG GCCAAAGCAC
CTGCGCAACA AAGCCAGCGG CTTTTTGATC TCCGGCTTCT CTGTGGGTGC CGTTATCGCG
GCGCAGATCT ACAGCCTCGT CGTTCCCGTC TGGGGCTGGC GCGCGCTGTT TTTTATCGGC
ATCCTGCCGA TCATTTTTGC CCTGTGGCTG CGTAAAAATA TTCCGGAAGC GGAAGACTGG
AAAGCCAGAC ACGAAGGTAA AGCGCCCGTG CGCACAATGG TGGATATTCT CTATCGTGGC
AAACATCGGG TGATCAATAT CGCTATGACG CTCTTCGCGG GAACGGCGCT GTGGTTCTGC
TTTGCTGGGG AACTCAACAA TGCCGGTCTG GTGATGATTC TTGGCCTGCT GTGCGCCTTT
GTGTTCATCA GCTTTATGGT GCAAAGCAGC GGTAAGCGCT GGCCGACGGG CGTCACTCTG
ATGGTGGTCG TGCTGTTTGC GTTCCTTTAT TCGTGGCCGA TTCAGGCGCT GCTGCCGACC
TATCTGAAAA CGGATTTGGC GTACGATCCG GCGATGGTGG CGCGCGTGCT GTTCTTCAGC
GGCTTTGGCG CAGCGGTTGG CTGCTGTGTC GGCGGTTTTC TGGGCGACTG GCTGGGGACG
CGCAAAGCGT ACGTTTGCAG CCTGCTGGCC TCGCAGCTCC TTATCATTCC GGTCTTTGCG
ATTGGCGGGT CCAATATCTG GGTTCTCGGT TTTCTGCTCT TCTTCCAGCA GATGCTCGGG
CAGGGGATTT CCGGAATTCT GCCGAAGCTT ATCGGCGGCT ACTTCGATAC CGATCAGCGT
GCCGCCGGGC TGGGCTTTAC TTACAACGTC GGCGCATTGG GTGGGGCGAT TGCGCCGGTG
ATTGGGGCGC TGCTGGCACA ACGTCTGGAG CTGGGCACCG CGCTCGGTTC ACTCTCGTTC
GGCTTAACCT TTGTCGTCAT CCTGTTGATT GGCCTGGATA TGCCGTCTCG CGTGCAGCGC
TGGATCCGCC CCGAGGCACT TCGCACGCAC GATGCGATTG ACGGTAAACC GTTCAGCGGG
GCCGTTCCTC TAGGCAGCAT CAAAAGCGAG CGTGAGGCGG TGAAATGA
 
Protein sequence
MRIPTQSIPW YRHLSRPQWR AFSAAWLGYL LDGFDFVLIA LVLTEVKTEF GLTTVEAASL 
ISAAFISRWF GGLLLGAMGD RYGRKLAMIT SIVLFSCGTL ACGFAPGYIT MFIARMVIGM
GMAGEYGSSA TYVIESWPKH LRNKASGFLI SGFSVGAVIA AQIYSLVVPV WGWRALFFIG
ILPIIFALWL RKNIPEAEDW KARHEGKAPV RTMVDILYRG KHRVINIAMT LFAGTALWFC
FAGELNNAGL VMILGLLCAF VFISFMVQSS GKRWPTGVTL MVVVLFAFLY SWPIQALLPT
YLKTDLAYDP AMVARVLFFS GFGAAVGCCV GGFLGDWLGT RKAYVCSLLA SQLLIIPVFA
IGGSNIWVLG FLLFFQQMLG QGISGILPKL IGGYFDTDQR AAGLGFTYNV GALGGAIAPV
IGALLAQRLE LGTALGSLSF GLTFVVILLI GLDMPSRVQR WIRPEALRTH DAIDGKPFSG
AVPLGSIKSE REAVK