Gene Xaut_4487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_4487 
Symbol 
ID5421911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp4979898 
End bp4981898 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content65% 
IMG OID640883751 
ProductTP901 family phage tail tape measure protein 
Protein accessionYP_001419365 
Protein GI154248407 
COG category[S] Function unknown 
COG ID[COG5280] Phage-related minor tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.108183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0686634 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGGCA AAACCCTCGA CGTCTCGATC CTCGTGCGCC TGGTGGACCG TGTCACCGGC 
CCGCTCTCAG TGCTCCAGCG CAAGTTCGCC GGTCTCGCCC AGCTGAGCCA GCGCATCGGC
ATCCTCGGCG CCGCCGTCGC CGGTATCTCG TTTGCCGTGC CGCTGGCGTC GGCTGCAGCC
TATGACCAAC AGCTGCGCGA CATGGCGGTG ACCGCCGGCG AGTTCGGCCA GGGCGCGGAG
CGTATGATCC GCAAGGTCGG TCGCGATATG GAGCAGTTGG CGCTCAAGAC AGGCATCGCC
TCCAAGGCGC TTGCAGATGC ACGGGGTGTC CTGGTGTCCG GCGGCTTCGA CAATGGCCTC
GTCACCGATC TCATGCCCAC CATCGGCAGG GTGTCCAAGG CGGCGAGCGC CGACCCCATC
GATACCGCCA AGACTGCCGG TGCTCTCGCT GGCCCCCTCA AGATCGCCGC AGCTGACATG
GAGCAATCGC TGGCGATGCT CGTGGTCGCC GGCAAGCTGG GGGCGTTCGA GTTCAAAAAT
ATGGCGAAGG AGCTGCCGGG GCTGGCGGCT CAGATGAACA ATCTCGGCGT GACCGGCAAG
GAGGCTGTAG CCTCCCTTGG TGCAGGGCTT CAGATTGCGA TGAAGGCGAC GGATTCGCCG
GCAACGGCCG CGAACAACAT GAAGAATTTT CTCGCCAAGC TGAACGCGCC TGAGACCGTG
AAAAACTTTG AGGAGGCCGG CGTCAACCTT CCCGCCGTTA AGGCAGACGC CGTGGCCAAG
GGGATCAACC CGGTCGAAGC GGTCATTCAG AAGCTGATGG ACCTGACCAA GGTGCCGCAG
AAGGAGATCG ATGCGATCTA CAAGCGGTCG AAGGATGCCG GGAAAACCGA TGTCGAGGCG
GCAGAGGATG TGAAGTCGCG GATCGAGCAG GCGCTGGCAG GATCGAAGGT CGGCAAGCTC
TTCGGCGACA TGCAGGCGCT GGATTTCATT ACACCGATGC GCATCTACAC CCAGCTCTAC
AAACAATATA CGGAGGCCAT CAAGGCGGCC GACGTTGGTG TGATCGGTCG AGATTCGGAA
ACCCAGCTCG CCGGCCTCGA CAGCGCCCTG AAGGAAACAG CCGAAATCTC TGAACAGGCC
GGCCGCCGGA TCGGGGACGG CTTCGCTCCC GTCCTGTTGC AGGTCAACAA GGGCCTCAAG
GCCGCACTGG GATGGATGCG GGAGGTGGAT GAGGCGTATC CCAGCCTGAT CGATTACACC
CTCCTGGGTG TCGGCGCGTT CCTCGCCCTG GTGGTCGCGC TCGCCGCGCT GGGGCCAGCG
TTCGCCATCA TCTCCGCTGG CTTTGGGGTT CTTCTGCTGC TCTGGTCGCC CATTGGCGCC
GCCATCATGG GCATCGCGGC GCTCGCGGTG CTGGTTTGGG CCAACTGGTC GACGGTGGGG
CCGATGTTCG CCCGCATGTG GGAGGGCATC AAGACGGTGT TCTCCGGCTT CGTCGAGTGG
GTCGCCGGCA TCTTCACCCT CGATTCCAAG CGAGCGGCCG ATGGCGTGCG CACTATGTTT
GAGGGTCTCG CCACCATCGC CGGTGGCCTG TGGGACCTCG TCAAGGTGTC CTGGACGGGC
TTCGTGGCGT GGATCGATGG CTGGACGGCC GGTGCCTTCA CCGGCGCGAT CAACGGCATC
AAGGCCGCGT GGCAAGGCCT GATCGACTGG TTCAAGTCGC ACGTGCCCAG CTTCGACATC
CAGATGCCAG ATTGGGTCAA GCGCCTCTAT GGCGGGCAGA TGGCAACCCT GCCGGCGGTC
GACGCACCCG TCAATCCAAT GGGCGACGGC ACCGGCTGGT CGGCGCCGGG CTTCGCGGCG
CCGACTGCGG CGGCCGGCAC CACCAAGGTC GGCGGCGAGA TCATCGTGCG CGCCGAGCCC
GGCACCGAGG CGCGGGTGCA GTCGGACAAC CCCGCGGTGC CCATGGTGCA GGATCGCGGC
CTCGTCCTCG GTCGGCCGTG A
 
Protein sequence
MSGKTLDVSI LVRLVDRVTG PLSVLQRKFA GLAQLSQRIG ILGAAVAGIS FAVPLASAAA 
YDQQLRDMAV TAGEFGQGAE RMIRKVGRDM EQLALKTGIA SKALADARGV LVSGGFDNGL
VTDLMPTIGR VSKAASADPI DTAKTAGALA GPLKIAAADM EQSLAMLVVA GKLGAFEFKN
MAKELPGLAA QMNNLGVTGK EAVASLGAGL QIAMKATDSP ATAANNMKNF LAKLNAPETV
KNFEEAGVNL PAVKADAVAK GINPVEAVIQ KLMDLTKVPQ KEIDAIYKRS KDAGKTDVEA
AEDVKSRIEQ ALAGSKVGKL FGDMQALDFI TPMRIYTQLY KQYTEAIKAA DVGVIGRDSE
TQLAGLDSAL KETAEISEQA GRRIGDGFAP VLLQVNKGLK AALGWMREVD EAYPSLIDYT
LLGVGAFLAL VVALAALGPA FAIISAGFGV LLLLWSPIGA AIMGIAALAV LVWANWSTVG
PMFARMWEGI KTVFSGFVEW VAGIFTLDSK RAADGVRTMF EGLATIAGGL WDLVKVSWTG
FVAWIDGWTA GAFTGAINGI KAAWQGLIDW FKSHVPSFDI QMPDWVKRLY GGQMATLPAV
DAPVNPMGDG TGWSAPGFAA PTAAAGTTKV GGEIIVRAEP GTEARVQSDN PAVPMVQDRG
LVLGRP