Gene Ent638_3478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3478 
Symbol 
ID5112983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3785559 
End bp3787988 
Gene Length2430 bp 
Protein Length809 aa 
Translation table11 
GC content59% 
IMG OID640493683 
ProductTP901 family phage tail tape measure protein 
Protein accessionYP_001178188 
Protein GI146313114 
COG category[S] Function unknown 
COG ID[COG5283] Phage-related tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.790573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.139413 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATA ACGTCAGACT TGAGGTGCTG CTGAACGCAG TCGACCGGGC AAGCCGACCA 
CTTAAAGCTA TCCAGACCGC CAGCAAATCC CTGTCGGGCG ATATCCGCAC TTCACAGAAA
TCCCTGCGCG AGCTCAATGC GCAGGCGTCA CGCATTGACG GATTTCGTAA AGCCAGCGCG
CAGCTTGCCG TGACCGGCCA GTCGCTTGAG AAAGCAAAGC TTGAAGCGCA AGCCCTTGCC
ACGCAGTTTA AAAATACCGA GCGCCCGACG CGCGCACAGG CGCAGGTGCT TGAATCCGCG
AAGCGTGCCG CCGAGGGGTT GCAGACCAAA TATAACAGCC TGACAGAGTC CATTAAGCGC
CAGCAACGCG AGCTCGGGGC AGCGGGGATT AATACGCGCA ATCTGGCGAA TGATGAGCGG
GGGCTAAAAT CGCGCATCAG CGAGACAACC GCGCAGCTTA ACCGCCAGCG TGAGGCACTG
GCGAAAGTCA GCGCTCAACA GGCCAGACTT AGCCAGGTAA AAGACCGATA TCAGGCCGGT
AAATCTCTTG CGGGAAGCAT GGCGGGAGCG GGGGCTGCCG GGGTCGGTAT TGCGACAACG
GGAACCGTGG CCGGGGTAAA ACTGATGATG CCGGGCTTTG ACTTTGCGCA GAAAAATTCC
GAGCTGCAGG CTGTGCTCGG CGTCGAAAAA CAGTCGCCCG AAATGCAGGC GCTGCGTAAA
CAGGCGCGAC AGCTCGGTGA CAACACCGCC GCCTCTGCTG ACGATGCGGC CGGTGCGCAA
ATTATCATTG CCAAAAGCGG TGGCGATGCG GCGGCCATTC AGGCGGCGAC GCCGGTCACG
CTGAATATGG CGCTGTCCAA CAAGCGCACG ATGGAGGAGA ACGCCGCGCT GCTGACGGGA
ATGAAATCCG CGTTTCAGCT TTCCAACGAC AAAGTCGCGC ATATTGGCGA TGTGCTGTCG
ATGACGATGA ACAAAACCGC TGCTGACTTT GACGGGATGA GCGACGCGCT GACCTATGCC
GCGCCGGTCG CAAAAAATGC CGGGGTGAGT ATCGAAGAAA CTGCCGCGAT GGTCGGTGCG
CTGCACGATT CTAAAATCAC CGGCTCGATG GCGGGAACGG GGAGCCGTGC AGTGCTGAGT
CGCCTGCAGG CTCCGACCGG TAAAGCGTAT GACGCAATCA AAGAGCTCGG GATTAAAACG
TCTGACAGTA AGGGAAACAC GCGCCCGATA TTTTCCATCC TGAAAGAAAT GCAGCGCAGT
TTTGAGAAAA ACAACCTCGG CACTGGCCAG AAAGCCGAAT ACATGAAAAC CATTTTCGGA
GAGGAGGCCA GCTCGGCGGC AGCGGTGCTG ATGGCCGCAG CCTCAAGCGG CAAGCTTGAC
CAGCTCACCG CTGCGTTTAA AGCCTCGGAC GGCAAAACCG AGGAGCTGGT TAAGGTCATG
CAGGACAACC TCGGCGGCGA CTTTAAAGAA TTTCAGTCAG CCTATGAGGC GGTCGGGACT
GACCTGTTTG ACCAGCAAGA GGGCTCACTG CGTAAGCTGA CACAGACGGC GACGCAGTAT
GTTTTAAAAA TTGACGGCTG GATTACCAAA AACAAAGGAC TGGCGACCAC TATCGGCGTG
GTGGTGGGGG GAGCGCTAGC GCTCATTGGC GTGATGGGCG GGATTGGCCT TGTCGCGTGG
CCGGTGGTGA TGGGGATTAA TGCCATTATC GCCGCTGCTG GCGTGCTGGG TGTGGTATTC
AGCTCGGTCG GCACTGCCAT TGGTGCAATC AGTCTGCCGG TGGTGGCCGT GGTCGCGGCT
GTGGTGGCGG GTGCGCTGCT CATTCGCAAA TACTGGGAGC CGATTAGTGC CTTTTTCTCG
GGCGTGGTGG AGGGGCTTAA AGCCGCTTTC GCGCCGGTCG GAGAAATGTT TGCACCGCTC
GCGCCGGTGT TTGACTCCAT CGCGGAAAAG CTCGGTGTGG TCTGGAAATG GTTTACTGAC
CTGCTTGCGC CGGTGAAAGC CACGCAGGAG ACGCTCGACC GCTGCAAAAA TGTCGGCGTG
GCCTTTGGTC AGGCGCTGGC TGATGCGCTG ATGGCTCCGC TCAACATCTT TAACAGTCTG
AGCGGAAAGG TGAGCTGGTT GCTGGAAAAA CTCGGGGTTA TCAAAAAGGA ATCCAGCGAC
CTCGACCAGA ACGCCGCGAA AACGGACAAG ACCGCCGCAA ATGGCGGGTA TATCCCGGCA
ACAGCGGCCT ATGGCGGCTA TCAGAGTTAT CAACCTGTCA CGGCTCCCGC AGGGCGCTCG
TATATCGACC AGAGCAAAAG CGAGTACAAC ATCACCCTGC AGGGCGGGGT CGCGGCGGGG
AGTGACCTCG ACCGCCAGCT CCGCGACGCC GTCGACAAGC TCGACCGCGA AAAACGTGCG
CGCCAGCGCT CCAGCATGAG ACACGATTGA
 
Protein sequence
MSNNVRLEVL LNAVDRASRP LKAIQTASKS LSGDIRTSQK SLRELNAQAS RIDGFRKASA 
QLAVTGQSLE KAKLEAQALA TQFKNTERPT RAQAQVLESA KRAAEGLQTK YNSLTESIKR
QQRELGAAGI NTRNLANDER GLKSRISETT AQLNRQREAL AKVSAQQARL SQVKDRYQAG
KSLAGSMAGA GAAGVGIATT GTVAGVKLMM PGFDFAQKNS ELQAVLGVEK QSPEMQALRK
QARQLGDNTA ASADDAAGAQ IIIAKSGGDA AAIQAATPVT LNMALSNKRT MEENAALLTG
MKSAFQLSND KVAHIGDVLS MTMNKTAADF DGMSDALTYA APVAKNAGVS IEETAAMVGA
LHDSKITGSM AGTGSRAVLS RLQAPTGKAY DAIKELGIKT SDSKGNTRPI FSILKEMQRS
FEKNNLGTGQ KAEYMKTIFG EEASSAAAVL MAAASSGKLD QLTAAFKASD GKTEELVKVM
QDNLGGDFKE FQSAYEAVGT DLFDQQEGSL RKLTQTATQY VLKIDGWITK NKGLATTIGV
VVGGALALIG VMGGIGLVAW PVVMGINAII AAAGVLGVVF SSVGTAIGAI SLPVVAVVAA
VVAGALLIRK YWEPISAFFS GVVEGLKAAF APVGEMFAPL APVFDSIAEK LGVVWKWFTD
LLAPVKATQE TLDRCKNVGV AFGQALADAL MAPLNIFNSL SGKVSWLLEK LGVIKKESSD
LDQNAAKTDK TAANGGYIPA TAAYGGYQSY QPVTAPAGRS YIDQSKSEYN ITLQGGVAAG
SDLDRQLRDA VDKLDREKRA RQRSSMRHD