Gene Ent638_1988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1988 
Symbol 
ID5113404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2161568 
End bp2162839 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content55% 
IMG OID640492176 
Productputative substrate-binding periplasmic transport protein 
Protein accessionYP_001176715 
Protein GI146311641 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.146215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCGTC GTACCTTGTT AAAAGCTTTT GCGCTGTCGG CTTCCGTGGC GGCTATGGGA 
ATGAGTTTTG GTGTGCAAGC TGCTGACACC ATCAAAATTG GGATCATGCA TTCGCTCTCG
GGCACGATGG CGATCTCTGA AACGCCCCTT AAAGACGTCG CGCTGATGAC GATCGATGAA
ATCAACGCCA AAGGCGGCGT GCTGGGCAAA AAACTGGAGC CGGTGGTGGT TGACCCCGCA
TCAAACTGGC CGCTGTTTGC TGAAAAAGCC CGTCAGTTAC TCAGCCAGGA CAAAGTGGCG
GCGGTCTTTG GCTGCTGGAC GTCGGTTTCC AGAAAATCGG TGTTGCCAGT CTTTGAAGAG
GTGAACGGGC TGCTGTTCTA CCCGGTGCAG TATGAGGGCG AAGAGATGTC ACCAAACGTC
TTCTACACTG GCGCCGCGCC AAATCAGCAG GCCATCCCGG CGGTGGAATA TCTGATGGGT
GAAGACGGCG GGAGTGCTAA ACGCTTCTTC CTCTTGGGCA CTGACTACGT TTATCCGCGT
ACCACCAACA AAATTCTGCG TGCGTTCCTG CATTCGAAAG GCGTGCAGGA CAAAGACATC
GAAGAGGTTT ACACCCCGTT TGGTCACAGC GATTACCAGA CTATTGTCGC CAATATCAAG
AAATTCTCCG CGGGCGGAAA AACGGCGGTG GTGTCGACCA TCAACGGCGA CTCCAACGTT
CCCTTCTACA AAGAACTGGC AAACCAGGGC GTAAAAGCAA CCGACGTGCC GGTGGTGGCG
TTCTCGGTGG GTGAAGAGGA GCTTCGCGGC ATCGACACTA AACCGCTGGT GGGTAACCTT
GCGGCGTGGA ACTACTTTGA GTCAGTTGAT AACCCAACCA ACCAGGCGTT TGTCGCGGCC
TATAAAGCCT ATGCCAAAGC GCATAACTTG CCGAACGCCG ACACCGTCGT AACGAATGAT
CCGATGGAAG CGACCTACGT GGGGATCCAT ATGTGGGCGC AGGCGGTTGA GAAAGCCGGC
ACCACGGATG TGGATAAAGT GCGTGCCGCC ATGGCCGGTC AATCCTTTAA AGCGCCGTCA
GGCTTTACGC TGACCATGGA TGCAACTAAT CACCATCTGC ATAAGCCGGT GATGATTGGC
GAAACCGAGG GTAACGGCCA GTTCAACGTT GTCTGGCAGA CCGACGCGCC GGTTCGTGCT
CAGCCGTGGA GCCCGTACAT TCCCGGTAAC GATAAAAAGC CAGAACAACC GATGAAAACC
GCCAGCAACT AA
 
Protein sequence
MHRRTLLKAF ALSASVAAMG MSFGVQAADT IKIGIMHSLS GTMAISETPL KDVALMTIDE 
INAKGGVLGK KLEPVVVDPA SNWPLFAEKA RQLLSQDKVA AVFGCWTSVS RKSVLPVFEE
VNGLLFYPVQ YEGEEMSPNV FYTGAAPNQQ AIPAVEYLMG EDGGSAKRFF LLGTDYVYPR
TTNKILRAFL HSKGVQDKDI EEVYTPFGHS DYQTIVANIK KFSAGGKTAV VSTINGDSNV
PFYKELANQG VKATDVPVVA FSVGEEELRG IDTKPLVGNL AAWNYFESVD NPTNQAFVAA
YKAYAKAHNL PNADTVVTND PMEATYVGIH MWAQAVEKAG TTDVDKVRAA MAGQSFKAPS
GFTLTMDATN HHLHKPVMIG ETEGNGQFNV VWQTDAPVRA QPWSPYIPGN DKKPEQPMKT
ASN