Gene Ent638_3707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3707 
Symbol 
ID5112265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4019441 
End bp4020466 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content51% 
IMG OID640493912 
Productextracellular solute-binding protein 
Protein accessionYP_001178415 
Protein GI146313341 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID[TIGR01096] lysine-arginine-ornithine-binding periplasmic protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.371845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA CGATGATAGC CAGCCTGGCC GCCGCCGGCA TGTTGTTTGC TGTAGCGGGT 
CAGGCGCATG CAGGCGCAAC TCTGGATGCC GTTAAAAAGA AAGGCTTTGT ACAGTGCGGT
ATCAGCGATG GGTTACCGGG TTTCTCCTAC GCGGATGCTA ACGGGAAATT CTCCGGGATC
GACGTTGACG TGTGCCGAGG CGTTGCAGCG GCTCTCTTCG GTGATGATAC CAAAGTAAAA
TACACCCCAC TCACAGCGAA AGAACGTTTC ACCGCTTTGC AATCTGGCGA AGTTGATGTG
CTCTCGCGTA ATACCACCTG GACCTCGTCT CGTGATGCTG GCATGGGCAT GACGTTTACT
GGCGTCACCT ATTATGACGG TATCGGTTTC CTGACTCACA ATAAAGCAGG CCTGAAGAGT
GCGAAAGAAC TCGACGGTGC GACTGTCTGT ATTCAGGCCG GTACGGATAC CGAGTTGAAC
GTCGCGGATT ATTTCAAAGC GAATAAGATG AAATACACCC CAGTGACGTT TGATCGCTCT
GATGAATCCG CAAAAGCTCT GGAATCAGGC CGTTGCGATA CGCTGGCCTC TGACCAGTCT
CAGCTGTATG CCCTTCGCAT TAAGCTCAGT AATCCTGCGG AGTGGATTGT TCTGCCTGAA
GTTATCTCAA AAGAACCTCT TGGCCCAGTC GTTCGTCGCG GTGATGAAGA GTGGACCTCG
ATTGTTAAGT GGACTCTCTT CGCCATGCTG AATGCTGAAG AAATGGGAAT TAACTCGAAG
AACGTTGATG AGAAAGCAGC AGCTCCATCC ACTCCGGATA TGGCACATCT TCTGGGTAAA
GAAGGTGACT ACGGCAAGGA TCTTAAGCTC GATAATAAAT GGGCTTACAA CATCATTAAA
CACGTTGGCA ACTACGGAGA GATCTTCGCG CGTAACGTGG GATCGGAAAG CCCTCTGAAG
ATCAAACGTG GCCAGAACAA CCTCTGGAAC AACGGCGGCA TCCAGTACGC TCCACCAGTA
CGCTAG
 
Protein sequence
MKKTMIASLA AAGMLFAVAG QAHAGATLDA VKKKGFVQCG ISDGLPGFSY ADANGKFSGI 
DVDVCRGVAA ALFGDDTKVK YTPLTAKERF TALQSGEVDV LSRNTTWTSS RDAGMGMTFT
GVTYYDGIGF LTHNKAGLKS AKELDGATVC IQAGTDTELN VADYFKANKM KYTPVTFDRS
DESAKALESG RCDTLASDQS QLYALRIKLS NPAEWIVLPE VISKEPLGPV VRRGDEEWTS
IVKWTLFAML NAEEMGINSK NVDEKAAAPS TPDMAHLLGK EGDYGKDLKL DNKWAYNIIK
HVGNYGEIFA RNVGSESPLK IKRGQNNLWN NGGIQYAPPV R