Gene Ent638_2302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2302 
Symbol 
ID5111123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2477535 
End bp2479169 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content49% 
IMG OID640492483 
Productextracellular solute-binding protein 
Protein accessionYP_001177022 
Protein GI146311948 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00365913 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATCA TCACAAAAAA AAATCTGATT GCCGCGGGAA TTTTCACTGC GCTAATCGCG 
GGTAATATGG CAATGGCTGC CGAAGTGCCT GCCGGAATTG CATTGGCAGA AAAGCAAACC
ATGATTCGAA ACAACGGTGC AGAGCCACAG TCTCTTGACC CGAACAAAAT CGAAGGCGTT
CCGGAAGCAA ACATCAGCCG CGACCTGTTT GAAGGCCTGC TGATTACCTC AACGAAAGAC
GGTCACCCAA TCCCCGGTGT GGCTGAAAGC TGGGATAACA AAGACTTTAA AGTCTGGACA
TTCCACCTGC GTAAAGACGC AAAATGGTCC AACGGTGAAC CTGTCACCGC ACAAGATTTC
GTGTATAGCT GGCAGCGTCT GGTTGATCCT AATACCGCGT CCCCGTATGC GAGCTATCCG
CAATATGGTC ACATCCTGAA CGTTGATGAA ATCATCGACG GCAAAAAGAA ACCGTCTGAA
CTTGGCGTGA AAGCAGTTGA CGACCACACT CTGGAAGTCA CACTGAGCGA ACCGGTTCCC
TATTTCTACA AATTGCTGGT TAACCCGGCA ATGTCCCCGG TCAATAAAGC CGCCATCGAA
AAATTTGGCG AAAAATGGAC TCAGCCTGCG AACATCGTGA CCAACGGTGC GTATAAGCTG
AAAGACTGGG TTGTTAACGA ACGCATCGTG ATGGAGCGTA ATACCAACTA CTGGGATAAC
GCCAAAACCA TCGTTGATCA GATCACTTAT CTGCCTATTT CTTCAGAAGT CACTGACGTT
AACCGCTACC GCAGCGGTGA AATCGATATG ACCTATAACA ACCTGCCGAT TGAACTCTTC
CAGAAACTGA AGAAAGAGAT CCCGACAGAA GTTCACGTCG ATCCGTATCT GTGTACTTAT
TACTATGAAA TCAACAACCA GAAAGCACCG TTCAACGACG CGCGCGTTCG TACTGCGCTG
AAGCTGGGTC TTGATCGCGA TATCATCGTG AACAAAGTGA AAGCGCAGGG CGATCTTCCA
GCTTTCGGTT ATACCCCGCC ATATGCTGAC GGCGCGAAAC TGACCAAACC TGAGTGGTTC
ACCTGGACTC AGGAAAAACG TAACGAAGAA GCTAAAAAAC TGCTGGCTGA AGCGGGCTAC
ACCGCTGACA AACCACTGAA ATTTGACCTG TTGTATAACA CGTCAGATCT GCACAAAAAA
CTGGCGATCG CCGCGTCCTC CATCTGGAAG AAAAACCTGG GTGTTGACGT GAAGTTGGTT
AACCAGGAGT GGAAAACCTT CCTGGATACC CGTCATCAGG GCAACTATGA CGTGGCGCGT
GCAGGCTGGT GTGCGGATTA CAACGAACCA ACATCCTTCC TGAACACGAT GCTTTCTGAC
AGCTCGATGA ACACCGCGCA CTATAAGAGC CCGGCGTTTG ACGCGATCAT GAAAGAGTCT
CTGAAAGCCA CCGACGAAGC ACAGCGTTCC GCTCAGTACG ACAAAGCGGA GCAGCAGTTG
GGTAAAGACT CTGCGATTGT ACCGGTTTAC TACTATGTGA ACGCGCGTCT GGTGAAACCT
TGGGTTGGCG GTTATACCGG CAAAGACCCG ATGGACAACA TCTACACCAA AGATCTGTAC
ATCATTAAGC ATTAA
 
Protein sequence
MSIITKKNLI AAGIFTALIA GNMAMAAEVP AGIALAEKQT MIRNNGAEPQ SLDPNKIEGV 
PEANISRDLF EGLLITSTKD GHPIPGVAES WDNKDFKVWT FHLRKDAKWS NGEPVTAQDF
VYSWQRLVDP NTASPYASYP QYGHILNVDE IIDGKKKPSE LGVKAVDDHT LEVTLSEPVP
YFYKLLVNPA MSPVNKAAIE KFGEKWTQPA NIVTNGAYKL KDWVVNERIV MERNTNYWDN
AKTIVDQITY LPISSEVTDV NRYRSGEIDM TYNNLPIELF QKLKKEIPTE VHVDPYLCTY
YYEINNQKAP FNDARVRTAL KLGLDRDIIV NKVKAQGDLP AFGYTPPYAD GAKLTKPEWF
TWTQEKRNEE AKKLLAEAGY TADKPLKFDL LYNTSDLHKK LAIAASSIWK KNLGVDVKLV
NQEWKTFLDT RHQGNYDVAR AGWCADYNEP TSFLNTMLSD SSMNTAHYKS PAFDAIMKES
LKATDEAQRS AQYDKAEQQL GKDSAIVPVY YYVNARLVKP WVGGYTGKDP MDNIYTKDLY
IIKH