Gene Ent638_4053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_4053 
Symbol 
ID5110807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4406607 
End bp4407911 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content59% 
IMG OID640494278 
Productsodium/sulphate symporter 
Protein accessionYP_001178759 
Protein GI146313685 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTCT GGTTTACTCA CCCTCTTTTT CTGCCCTCGC TCATTGTTGG CATCACCATC 
GTGCTGTGGG CGACCTCGCT CCTGCCGGAA TTTATCACCG CGCTGCTGTT CTTTACGGCA
GCGATGATCG CCAAAATTGC CCCTGCGGAT GTCATTTTCG GAGGCTTTGC ATCATCGGCA
TTCTGGCTGG TCTTCAGCGG ATTTGTGCTC GGTGTGGCGA TTCGCAAAAC CGGCCTGGCG
GACAGGGCGG CGCGAGCGCT ATCGGCAAAA CTGACCGATT CGTGGCTGTT GATGGTGGCA
ACTGTGGTGC TGCTGAGTTA TGCCCTGGCG TTTGTGATGC CGTCGAACAT GGGGCGCATC
GCGCTGCTGA TGCCGATTGT GGCTGCGATG GCTAAACGCG CCAGCATCGC GGACGGCTCC
CGTGCGTGGT TTGGTCTGGC GCTGGCGGTG GGTTTCGGGA CGTTCCAGCT TTCCGCGACT
ATTTTGCCCG CTAATGTGCC CAATCTGGTG ATGAGTGGCG CGGCGGAAGG TTCATACGGC
ATCCATCTGA ACTACGTGCC TTATCTCCTG CTGCACACGC CGGTGCTCGG CATTCTGAAA
GGACTGATTC TGATTGGGCT GATCTGCTGG CTGTTCCCCG GCTCACCGAA ACATCCGCAG
GAGGTTTCTG CGCCGGAACC GATGGGACGC GATGAGAAAC GGCTCGCCTG GCTTTTGGCG
GTAGTGCTGG TGATGTGGGT GACGGAGAGT TGGCACGGAA TTGGCCCCGC GTGGACCGGG
CTGGCGGCAT CGCTGGTGGT GATGCTCCCG CGCATCGGCT TTATTACTGG CGAGGAGTTT
TCAGCGGGCG TGAATATGCG CACCTGTATC TACGTGGCGG GTATTTTGGG GCTGGCTATC
ACCGTCACGC AGACGGGAAT TGGGAGCGCC GTAGGAGAGG CGCTGCTTCA CCTGATGCCG
CTGGACGCGG ATAAGCCTTT CACCAGTTTC CTGGCGCTCA CGGGGATCAC CACGGCGCTT
AACTTCATCA TGACCGCCAA CGGCGTTCCG GCGCTGTACA CCACGCTGGC GCAGAGTTTT
TCCGACGCGA CCGGTTTCCC GCTGCTGTCG GTGATTATGA TTCAGGTGTT GGGTTATTCC
ACGCCGCTGT TGCCGTATCA GGCGTCGCCG ATTGTGGTAG CGATGGGACT TGGGAAAGTG
CCTGCAAAGG CGGGAATGAT GCTGTGTCTG GCGCTGGCAG TGGCGACCTA TGTGGTCCTG
TTGCCGCTCG ATTACTTATG GTTTAGCGTG CTGGGGAAAT TATAG
 
Protein sequence
MSLWFTHPLF LPSLIVGITI VLWATSLLPE FITALLFFTA AMIAKIAPAD VIFGGFASSA 
FWLVFSGFVL GVAIRKTGLA DRAARALSAK LTDSWLLMVA TVVLLSYALA FVMPSNMGRI
ALLMPIVAAM AKRASIADGS RAWFGLALAV GFGTFQLSAT ILPANVPNLV MSGAAEGSYG
IHLNYVPYLL LHTPVLGILK GLILIGLICW LFPGSPKHPQ EVSAPEPMGR DEKRLAWLLA
VVLVMWVTES WHGIGPAWTG LAASLVVMLP RIGFITGEEF SAGVNMRTCI YVAGILGLAI
TVTQTGIGSA VGEALLHLMP LDADKPFTSF LALTGITTAL NFIMTANGVP ALYTTLAQSF
SDATGFPLLS VIMIQVLGYS TPLLPYQASP IVVAMGLGKV PAKAGMMLCL ALAVATYVVL
LPLDYLWFSV LGKL