Gene Ent638_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2033 
Symbol 
ID5113449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2206896 
End bp2208203 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content53% 
IMG OID640492221 
ProductHipA domain-containing protein 
Protein accessionYP_001176760 
Protein GI146311686 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.420759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCA GTAAACTGAC CGTTGCGATG AACGGAACCG TTGTCGGCAC GCTGTATCGT 
AACGGCAAAG GGGCGATGTC CTGGCGCTAT GCGCAGACAT GGCTCGACAC GCCGGGCGCG
CGCGTGATTT CGCAGTCGCT ACCGTTAACG CCGCGTCGTC AGGAAGGCGA GGCTGTGTAT
AACTTCTTTA GCAATCTGCT GCCTGATTCG CAGGCGATTA TCTCGCGCAT GCAAACGCGC
TTTAAAATCC CTACTGACCA TCCTTTTGAT TTACTGGCAA GCGTTGGCCG CGACTGTGTA
GGCGCAATCC AGCTCTACCC GGAAAACAGT GAAATACCGC CGGTGACGCA CATGCACGCC
ACGCCGCTTT CGGATAACGA GATTGCGGCC CTGCTGGAAG GGTATCGTAA TGCGCCGTTA
GGGATGACCG ACGACCAGGA CTTCAGGATC TCCATTGCCG GTGCGCAAGA AAAAACGGCG
CTGCTCTGGC ATCAGGATTG CTGGCAACGG CCCACCGGCA GCACGCCAAC CAGTCATATT
TTCAAACTGC CCATCGGCAA AATTGAACAG AACAATATTG ATCTGAGTGA AAGCTGTGAG
AATGAGTGGC TGTGTTTACG TCTTGCGCGT GAGTTCGGTT TTGCGGTAGC GGAGGCCACG
TTAGCGACAT ATGCCGACAA GAAAGTGCTG ATTGTCGAGC GTTTTGATCG TAAATGGTCT
CGAGACGGAA AGTGGCTAAT GCGCCTCCCG CAAGAAGATA TGTGTCAGGC TCTCGGCTAT
TCACCTGCGC TAAAGTATGA ATCTCACGGC GGGCCAGGCA TCGCCGACAT CATGACCTTG
CTGCTGGGAT CGCGCCGTTC AAACCAGGAT CGCGAAACGT TTTTCCGCAC GCAGATATTT
TACTGGCTGA TAGGCGCGAT CGATGGGCAT GCAAAAAATT ACAGTGTGTT TATCGAACCC
GATTCGGCTT ACGTGATGAC GCCGCTTTAC GATATTTTGT CGGCCTATCC CATCTTCGGG
CCAAAGGGTA TTTCTGCGCA AAAAGCGAAA ATGGCCATGG CGCTGCAGGG CAAAAACCGG
CAATACCACT GGGCTCAAAT TCAACCACGT CATTTCCCGG CCACGGCCGA GCGTGTTGGT
TTTTCTGCTA CGCGCGCGAA AAAGATGATG GTTGAAATGG GCGGTATGAC AAATGAGGTT
ATCGAGCGGG TGCGTGAGTC GTTACCGGTT GATTTCCCGA CACATATCAG CGAAGCGATT
TTTAACGGTA TGGCGAAACA AGCGGAAAGG CTAATAGGTT CAGAATAA
 
Protein sequence
MKTSKLTVAM NGTVVGTLYR NGKGAMSWRY AQTWLDTPGA RVISQSLPLT PRRQEGEAVY 
NFFSNLLPDS QAIISRMQTR FKIPTDHPFD LLASVGRDCV GAIQLYPENS EIPPVTHMHA
TPLSDNEIAA LLEGYRNAPL GMTDDQDFRI SIAGAQEKTA LLWHQDCWQR PTGSTPTSHI
FKLPIGKIEQ NNIDLSESCE NEWLCLRLAR EFGFAVAEAT LATYADKKVL IVERFDRKWS
RDGKWLMRLP QEDMCQALGY SPALKYESHG GPGIADIMTL LLGSRRSNQD RETFFRTQIF
YWLIGAIDGH AKNYSVFIEP DSAYVMTPLY DILSAYPIFG PKGISAQKAK MAMALQGKNR
QYHWAQIQPR HFPATAERVG FSATRAKKMM VEMGGMTNEV IERVRESLPV DFPTHISEAI
FNGMAKQAER LIGSE