Gene Xaut_3708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_3708 
Symbol 
ID5424197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp4113428 
End bp4114543 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content67% 
IMG OID640882964 
ProductHK97 family phage prohead protease 
Protein accessionYP_001418591 
Protein GI154247633 
COG category[R] General function prediction only 
COG ID[COG3740] Phage head maturation protease 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGACC CCCGGACGCT CTATGTCAGC CGGCCCCTGC TGAACGGCGC GGAACTGATC 
GATTGGGCGA AGGGGCAGGG GTTTACCAAG ACGGTCCCCG CCGACGATCT GCATGTGACC
ATCGCCTACA GCCGCGATCC CGTGGACTGG GCGGCAGCCG GTGACCACTT CGACCAGGTG
CGGGCCCGGG CTGACGGGCC GCGTTCAGTG GAGCAACTCG GCGACAAGGG CGCGGTCGTG
CTGCGGTTCG AGCATGCTGA ACTCGCCCAG CGCTGGCAGG CCTTCCGCGA CATTGGGGCG
TCCTGGGACC ACGACGGGTA CCGCCCGCAC GTGACGATCA CCTACGACGC GGCGGGTGTC
GACCTGAGCA AGGTCCAGCC GTTCAGCGGC GAGCTGGTCT TCGGACCCGA AGAGTTCGCG
GAGATCGATG AGGATCGGGC CGACCGGGTC CGGGCGAGTG AGAAGGGGCG GCGCGCCATG
GAGATCAAGA GCTTCGCTCT GGAACTGAAG GAGGTTGGCG ACGCCGGCAC CTTCACCGGC
TACGGGGCAG CGTTCGGCAA CGTCGACCAG GGGCGAGACC TCATCGCGCG CGGAGCCTTC
GCGGATAGCC TGTCGGCGTG GCGTTCGAAG GGCAAGTTGC CCAAGCTCCT GTGGCAGCAC
GACGCGCGCA AGCCGATCGG CGTGTGGACC GAGATGCGCG AGGACGACTA CGGGCTCTTC
GTGAAGGGCC GGTTCACCGC CGGCGTGAAG CAGGCGGATG AGGCGTACGC GCTGCTCAAG
GATGGTGCTC TGGATGGCCT TTCCATCGGC TACGCCACCA TCGAGGACGA GATCGACCGG
GCCGCAGGGA TCCGGAAGCT GGTCAAGCTC GACCTGATGG AGGTCAGCCT GGTCACCTTC
GCGATGAACC CGGCCGCCGG CGTCACCGGC GTGAAGGCGG GCCCGCCGCG CACCATTCGA
GAATTCGAGG CCGGGCTTCG GGAGAAGTTC GGTTTCTCGC ACGCCCAGGC GAAGTCGATC
GCTTCGTCCG GGTTCAAGTC GCTGGAGCCT CGGGATGAGG ACGGCGCGAT GAACGACCTG
CTGCGGACCA TCAAGGGCAT CCGGGCCGGT CTCTGA
 
Protein sequence
MGDPRTLYVS RPLLNGAELI DWAKGQGFTK TVPADDLHVT IAYSRDPVDW AAAGDHFDQV 
RARADGPRSV EQLGDKGAVV LRFEHAELAQ RWQAFRDIGA SWDHDGYRPH VTITYDAAGV
DLSKVQPFSG ELVFGPEEFA EIDEDRADRV RASEKGRRAM EIKSFALELK EVGDAGTFTG
YGAAFGNVDQ GRDLIARGAF ADSLSAWRSK GKLPKLLWQH DARKPIGVWT EMREDDYGLF
VKGRFTAGVK QADEAYALLK DGALDGLSIG YATIEDEIDR AAGIRKLVKL DLMEVSLVTF
AMNPAAGVTG VKAGPPRTIR EFEAGLREKF GFSHAQAKSI ASSGFKSLEP RDEDGAMNDL
LRTIKGIRAG L