Gene Ent638_3476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3476 
Symbol 
ID5112981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3783920 
End bp3785083 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content58% 
IMG OID640493681 
Productphage late control D family protein 
Protein accessionYP_001178186 
Protein GI146313112 
COG category[R] General function prediction only 
COG ID[COG3500] Phage protein D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.216601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.122645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAACGG ATATGAATAT TCAGGCCGGG GCACGCATCG CGCCTGCGTA TATGCTCACG 
CTCAATGGCG CGGATATCAC ACAGAATTTT AGCGACCGGC TTATCGGGCT GACCATGACC
GACAATCGCG GATTTGAGGC TGACCAGCTC GATATCGAGC TTGATGATAC CGACGGGCTG
GTCGAGTTGC CGCCGCGCGG GGCAAAGCTG ACGCTGTGGT TAGGCTGGCA GGGCTCCGCG
CTGGTGAATA AGGGGAGTTT TACGGTCGAT GAAATCGAGC ACCGGGGCGC GCCCGATACG
CTGACCATCC GGGGGCGCAG TGCGGATTTT CGCGGGACGC TTAACTCTCG CCGCGAGCAG
TCATGGCATG ACACTACGCT CGGGGTGATT GTCGAGACCA TCGCGCAGCG TAACAAACTG
ACGGCCAGCA TGGCGGATAC CCTGAAAGCC ATTGCGATCC CGCATATCGA CCAGGCGCAG
GAATCGGACA CGGCGTTTTT GTCCAGGCTG GCGGAGCGTA ACGGGGCGTC TGTCTCAGTA
AAAGCCGGGA AATTATTATT CCTGAAAGCG GGTAGCGCGA TGACGGCCAG CGGCAAACCC
ATCCCGCAAA TGACCGTCGA GCGCGGTGAC GGCGACCGGC ATCAGTTCGC CATTGCTGAC
CGGGAGGCGT ACACCGGCGT CACGGCGAAA TGGCTGCACA CGAAAGACCC GAAACCGCTA
AAGCAAAAGG TGAAGCTGAA ACGAAAGCCA AAGGTGCAGC ACCTGCGCGC GCTACAGCAT
CCGAAAGCGG CTAAAACCAC GGCAAAGGCC AAAGCCAAAA AGGAGCAGGA AGCGCGCGAG
GGTGAGTATA TGGTCGGTGA GGCTGACAAC GTGCTGGAGC TCACGACCAT CTACGCCACA
AAGGCGCAGG CCATGCGCGC TGCTCAGGCG AAGTGGGACA AAATACAGCG CGGAGTGGCG
GAGTTTTCAA TCTCGCTGGC GTATGGCCGT GCTGATTTAT TTCCTGAAAC GCCGGTTGCG
GTGAAGGGCT TTAAGCGCGT GATAGACGAG CAGGCGTGGA TAATCAGCCG GGTGGTGCAT
AACCTCAACG GGAACGGCTA CACGACGGGC TTAGAGCTCG AGGTGAAGCT TTCGGATGTT
GAATATGTAG CGGAGGAGGA TTAA
 
Protein sequence
MITDMNIQAG ARIAPAYMLT LNGADITQNF SDRLIGLTMT DNRGFEADQL DIELDDTDGL 
VELPPRGAKL TLWLGWQGSA LVNKGSFTVD EIEHRGAPDT LTIRGRSADF RGTLNSRREQ
SWHDTTLGVI VETIAQRNKL TASMADTLKA IAIPHIDQAQ ESDTAFLSRL AERNGASVSV
KAGKLLFLKA GSAMTASGKP IPQMTVERGD GDRHQFAIAD REAYTGVTAK WLHTKDPKPL
KQKVKLKRKP KVQHLRALQH PKAAKTTAKA KAKKEQEARE GEYMVGEADN VLELTTIYAT
KAQAMRAAQA KWDKIQRGVA EFSISLAYGR ADLFPETPVA VKGFKRVIDE QAWIISRVVH
NLNGNGYTTG LELEVKLSDV EYVAEED