Gene Ent638_3996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3996 
Symbol 
ID5110461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4332644 
End bp4333894 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content56% 
IMG OID640494214 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_001178702 
Protein GI146313628 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0920288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.168346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCTGG CGAAAGCATC GGTGTGGACC GCCGCGTCCA CGCTCGTAAA GATTGGCACC 
GGGCTGTTAG TGGTTAAACT TCTGGCCGTC TCGTACGGCC CCTCAGGTGT TGGTCAGGCC
GGCAATTTCC GCCAGCTTGT GACCGTGCTT GGGGTTCTCG CAGGTGCCGG TATTTTCAAC
GGCGTGACCA AATACGTTGC ACAGCATCAT GACGATACCG CATCGCTTCG CAAGGTGATC
GGTACCTCGT CCGCGATGGT ATTGGGTTTC TCGACGTTGC TGGCGGTTGT ATTTCTTCTT
GCGGCGGCCC CAATCAGTCA GGGGCTTTTC GGCAACACCC ATTATCAGGG CCTGGTGCGC
CTGGTTGCGC TGGTGCAGAT GGGCATTGCC TGGGCCAACC TGCTGTTAGC CTTAATGAAG
GGTTTTCGGG ATGCGGCCGG GAACGCGCTG GCGCTGATTG CGGGCAGTTT TATTGGCGTC
ATCGCCTATT ACTTTTGCTA TCGTCTGGGC GGCTACGAAG GCGCATTGCT TGGCCTGGCG
CTGGTTCCCG CGTTGGTCGT GATCCCCGCT GCGTTGATGT TGATGCGCAG ACGCACGATT
CCGCTAAGCT ATCTCAAACC GCAGTGGGAC AAAATTCTGG CGGGGCAATT GGGGAAATTT
ACCCTGATGG CACTCATCAC ATCCGTCACG TTACCCGTGG CCTACGTGAT GATGCGAAAC
CTGCTGGCGG CGCACTACAG CTGGGATGAA GTGGGGATCT GGCAAGGTGT GAGCAGTATT
TCTGACGCCT ATCTCCAGTT TATCACTGCG TCTTTTAGCG TTTATTTGCT GCCAACCTTG
TCGCGCCTGG TGTCAAAACA GGACATTACC CGCGAGATTG GCCGCTCTCT GCGTTTTGTT
CTTCCTGCCG TGGCTGTCGC GAGTTTGACC GTCTGGTTGC TGCGAGATGT AGCCATCTGG
CTGCTGTTCT CGGCAAAATT TACCGCGATG CGCGATCTGT TTGCCTGGCA ACTGGTGGGC
GATGTACTGA AAGTGGGGGC TTACGTTTTT GGCTATCTGG TGATTGCTAA AGCGTCGCTG
CGCTTGTACA TCCTGGCGGA AATCAGCCAG TTTTCGCTCT TAACCGCTTT CTCTCTTTGG
CTGATCCCTG CGCACGGCGC GCTGGGGGCA TCACAGGCCT ATATGGCGAC TTACATCGTT
TATTTCGCTG CCTGTTGCGG CGTATTTTTA CTTTGGCGTA AACGCGCATG A
 
Protein sequence
MSLAKASVWT AASTLVKIGT GLLVVKLLAV SYGPSGVGQA GNFRQLVTVL GVLAGAGIFN 
GVTKYVAQHH DDTASLRKVI GTSSAMVLGF STLLAVVFLL AAAPISQGLF GNTHYQGLVR
LVALVQMGIA WANLLLALMK GFRDAAGNAL ALIAGSFIGV IAYYFCYRLG GYEGALLGLA
LVPALVVIPA ALMLMRRRTI PLSYLKPQWD KILAGQLGKF TLMALITSVT LPVAYVMMRN
LLAAHYSWDE VGIWQGVSSI SDAYLQFITA SFSVYLLPTL SRLVSKQDIT REIGRSLRFV
LPAVAVASLT VWLLRDVAIW LLFSAKFTAM RDLFAWQLVG DVLKVGAYVF GYLVIAKASL
RLYILAEISQ FSLLTAFSLW LIPAHGALGA SQAYMATYIV YFAACCGVFL LWRKRA