Gene Xaut_3155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_3155 
Symbol 
ID5425004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp3502522 
End bp3503808 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content72% 
IMG OID640882401 
ProductHK97 family phage major capsid protein 
Protein accessionYP_001418042 
Protein GI154247084 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0231727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0147808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCATC CCGACACCGC CCGCACCATC GCTGGCGCGC CTGAGACCAA GGTGGCCGGC 
GCCGAGGCCA CTTCCGCCGA GGCCAGCTCC GCCGTGGCCG ATTTCCTTGC CGCCTTCGAG
ACCTACAAGC AGGTGAACGA CACCCGCCTG GCGCAGATGG AGCGGCGCAG CGCCGACGTG
CTCACCACCG AGCAGCTGGC CCGTATCGAT GCCGCCCTCG ATACCCACAA GGCGCGGCTC
GACGCCCTCG CCACCAAGGC GCGCCGGCCC GCGCTCGGCG CCGCGCCGGA GCGCACCGAG
GCGCCCGCCG CCACCCGCGA GCACACCGAT GCCTTCGCCA CCTATGTGCG CCACGGCGAG
GCCGGCGGCC TGAAGGCGCT GGAGGCGAAG GCCCTGTCCT CCGCGTCCGG CGATGCGGGC
GGCTACCTCG TGCCCTCGGA GACCGAGACC GAGATCGGCC GGCGCCTTGC GGTGCTCTCG
CCCATCCGCG CGCTGGCCTC GGTGCGGACC ATCGGCGGCG GCACCTATCG CAAGCCGTTC
ATGACCTCCG GCCCGGTCTC CGGCTGGGCG GCGGAGACGG CGGCCCGGCC GGAAACCGCG
AGCCCGGTGC TGGCGGAACT GGCCTTCCCG GCCATGGAGC TCTACGCCAT GCCCGCCGCC
ACCCAGTCGC TGCTGGACGA CGCGCAGGTG AATGTGGAGG AGTGGCTCGC CACCGAGGTG
GACACCGCCT TCGCGACCCA GGAGGGGGTG GCCTTCGTCA CCGGCGATGG CGTTGCCAAG
CCCAAGGGCT TCCTCGCCTA CACCAAGGTG GCCGAGAGCG CCTGGGCCTG GGACAAGGTG
GGCTATGTGG CCACCGGGGC TGCGGGCGCC TTCCCGTCCG CGACGCCGGC CGATCCGCTG
CTGGATCTGG TCTATTCGCT GAAGGCCGGC TACCGGCAGA ACGCCACCTT CGTCATGAAC
CGGCAGACGC AGGGCGCCGT GCGCAAGCTG AAGGACGAGA ACGGCAATTA CCTGTGGGCG
CCGCCCGCCG GGGTGGGCCA GGCCGCGAGC CTGATGGGCT TCCCGGTGGT GGAGAGCGAG
GCCATGCCGG ATGTGGCGGC CGACGCCTAT GCCATCGCCT TCGGCGACTT CCGCCGCTTC
TACCTGGTGG TGGACCGCGC CGGGGTGCGG GTGCTGCGCG ATCCCTATTC GGCCAAGCCC
TACGTGCTGT TCTACACCAC CAAGCGCGTG GGCGGCGGGG TGCAGGACTT CGACGCCGCC
AAGCTGCTGA AGTTCGCCGC GAGCTGA
 
Protein sequence
MSHPDTARTI AGAPETKVAG AEATSAEASS AVADFLAAFE TYKQVNDTRL AQMERRSADV 
LTTEQLARID AALDTHKARL DALATKARRP ALGAAPERTE APAATREHTD AFATYVRHGE
AGGLKALEAK ALSSASGDAG GYLVPSETET EIGRRLAVLS PIRALASVRT IGGGTYRKPF
MTSGPVSGWA AETAARPETA SPVLAELAFP AMELYAMPAA TQSLLDDAQV NVEEWLATEV
DTAFATQEGV AFVTGDGVAK PKGFLAYTKV AESAWAWDKV GYVATGAAGA FPSATPADPL
LDLVYSLKAG YRQNATFVMN RQTQGAVRKL KDENGNYLWA PPAGVGQAAS LMGFPVVESE
AMPDVAADAY AIAFGDFRRF YLVVDRAGVR VLRDPYSAKP YVLFYTTKRV GGGVQDFDAA
KLLKFAAS