Gene Ent638_4016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_4016 
Symbol 
ID5110481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4357292 
End bp4358812 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content53% 
IMG OID640494234 
Productputative ATP-dependent protease 
Protein accessionYP_001178722 
Protein GI146313648 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0606] Predicted ATPase with chaperone activity 
TIGRFAM ID[TIGR00368] Mg chelatase-related protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0359252 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTGT CTGTTGTTTA TACCCGCGCT GCTTTAGGAG TGCAGGCCCC GCTCATCTCT 
GTTGAGGTTC ATTTGAGTAA CGGTCTCCCA GGCTTAACGC TTGTCGGGCT GCCCGAAACG
ACAGTGAAAG AAGCCCGAGA TCGCGTTCGT AGCGCCATTA TCAATAGTGG TTATACCTTT
CCGGCAAAAA AGATAACCGT CAACCTGGCG CCTGCAGACT TGCCCAAAGA AGGCGGGCGA
TATGATTTAC CTATCGCTAT AGCGCTTCTC ACCGCCTCTG AGCAACTTAA CGCCCCCAGA
CTGAACACGT ACGAGTTTGT GGGCGAACTA GCGCTTACAG GTGCGTTACG TGGCGTTCCA
GGCGCAATCT CCGGGGCGTT AGCCGCCTTA AAGGCGGGAA GAGAGATTAT CGTTGCTGCC
GAGAATGCAT CAGAGGTCAG CCTGATCGAG AAAAAGGGTT GCCTGATTGC CAATCACCTA
CAAGAAGTCT GTGCTTTTCT CGAAGGCCGT CATGAGCTTG CCGAACCCGA GGAAAATCGT
TTAGTAGTAT CAGACTCTCT TGACGACCTG AGCGATATCA TTGGCCAGGA TCAGGGAAAG
CGTGCGCTGG AAATCACCGC GGCCGGTGGT CATAACCTTC TGCTTATCGG GCCGCCAGGG
ACAGGAAAAA CGATGCTGGC AAGTCGGCTG AATGGTCTGC TCCCACCACT AAATAATCAG
GAAGCGCTTG AGAGTGCCGC AATCATAAGT CTGGCGAATT CAATATCATT GAAGAAGCAA
TGGAAGAGAC GACCGTTTCG TTCTCCTCAT CATAGCGCCT CACTCTGTGC GATGGTCGGC
GGCGGCTCTA TTCCTGAGCC CGGTGAAATC TCTTTAGCTC ATAACGGAAT TCTTTTTCTT
GATGAGTTAC CGGAGTTTGA ACGACGGGTT CTGGATGCAT TACGTGAGCC TATCGAGTCC
GGACAGATCC ACATTTCGCG TACACGAGCG AAGATTAGCT ACCCCGCGCA GTTTCAGCTC
ATTGCCGCCA TGAATCCAAG CCCATCAGGG CATTATCAGG GCAATCACAA CCGGAGTACG
CCTGAACAAA CGCTTCGCTA CCTGGGGAGG TTGTCAGGGC CCTTCCTTGA CCGTTTTGAT
TTGTCCCTGG AGATCCCGCT GCCACCACCG GGAGTGCTTA GCCAGACAAA TTCTACGGGT
GAAACCAGCG TGACCGTGCG GGAAAGGGTG ATCATCGCAC AGGAACGGCA GTTGAAGCGC
CAAAATAAGC TTAATGCACG TCTCGATAAC GCCGAGATTC GCCAGCGTTG TCGTTTGTCA
GAGGAGGATT CCCGTTGGCT AGAAGAGACG TTAACGCGGC TTGGGCTTTC AGTGCGGGCC
TGGCAACGCT TATTGAAGGT TGCACGCACT ATTGCAGATT TGGAGAGTTG TGGTGATATC
GAGAGGAGGC ATTTGCAAGA AGCATTAAGC TATCGCGCGA TAGATCGTCT GCTGCTGCAT
CTGCAAAAGA TGCTGACGTA G
 
Protein sequence
MSLSVVYTRA ALGVQAPLIS VEVHLSNGLP GLTLVGLPET TVKEARDRVR SAIINSGYTF 
PAKKITVNLA PADLPKEGGR YDLPIAIALL TASEQLNAPR LNTYEFVGEL ALTGALRGVP
GAISGALAAL KAGREIIVAA ENASEVSLIE KKGCLIANHL QEVCAFLEGR HELAEPEENR
LVVSDSLDDL SDIIGQDQGK RALEITAAGG HNLLLIGPPG TGKTMLASRL NGLLPPLNNQ
EALESAAIIS LANSISLKKQ WKRRPFRSPH HSASLCAMVG GGSIPEPGEI SLAHNGILFL
DELPEFERRV LDALREPIES GQIHISRTRA KISYPAQFQL IAAMNPSPSG HYQGNHNRST
PEQTLRYLGR LSGPFLDRFD LSLEIPLPPP GVLSQTNSTG ETSVTVRERV IIAQERQLKR
QNKLNARLDN AEIRQRCRLS EEDSRWLEET LTRLGLSVRA WQRLLKVART IADLESCGDI
ERRHLQEALS YRAIDRLLLH LQKMLT