Gene Jann_1635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1635 
Symbol 
ID3934083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1622120 
End bp1623301 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content61% 
IMG OID637903986 
Productphage major capsid protein, HK97 
Protein accessionYP_509577 
Protein GI89054126 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.304591 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.904057 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA CCCAGGACCG GGGCGCGAGC AGCGTACCCA TGGCTGAAGT CAAGACCGCG 
ATCGGCGGAT TCTTGAGTGA GTTCAACCAA TTCCAAGACG ACATGAATGC AAAATTTCAA
AAGCAGGAAG ACCGTATCGC TATGCTCAAC ACGAAAACAA ACACGTTTCA ACGCCCCGCA
TTGTCGGCGG ATGTCGACAC CACCGCCCCC CACAAGCTGG CGCTGAAATC CTATCTTCGC
TGTGGCGATG ATGATGCGCT TCGCGGCCTG GAACTGGAAG GCAAGGCGAT GAACACCGCC
GTCAATGCCG AAGGTGGCTA TCTGGTCGAT CCGCAGACGT CCGAGATGAT CCAGTCGGTC
CTACGCTCCT CGTCCTCCCT GCGCTCGGTC GCCAATGTGG TGACGGTTGA AGCGACGTCC
TTTGACGTGC TGATCGACAG CACCGATACC GGCGCCGGTT GGGCCGATGA GGTGTCCAAC
ACGACCGAGA CCGACACGCC CACCATTGAG CGCATCTCCA TCGCGCTGCA CGAGCTGTCG
GCGCTGCCCA AAGCGTCCCA GCGTCTTCTG GACGACGCGG CATTTGACAT CGAAGGCTGG
CTGGCCGGTC GCATCGCCGA CAAGTTCGCC CGCGCCGAGG CCGCGTCCTT CATCACCGGT
GACGGCTCTG GCAAGCCCAC GGGCATGCTC ACCCACCCCA CCGTGGACAA CGACAGCTGG
TCTTGGGGCA ATCTGGGCTA TGTCGCGACT GGCACCGCGG GCGATTTCGA CAATACCAAC
GCCGCCGATG CAATTGTCGA TCTGGTCTAT GCACTGGGTG CACGCTACCG CGCCAACGCC
AACTTCATCA TGAACTCCAA GACCGCCGGT GCGGTGCGGA AGATGAAAGA CGCGGACGGC
CGCTTCCTGT GGTCCGATGG TCTGGCCGCC GGTGAGCCCG CGCGTCTTAT GGGCTACCCC
GTTCTGATCG CCGAGGATAT GCCCGACATC GCAACCGATG CGATGGCGAT TGCCTTCGGT
GATTTCGGCG CCGGCTACAC CATCGCCGAG CGTCCGGATC TGCGTGTGTT GCGTGACCCG
TTCTCGGCCA AGCCGCATGT CCTGTTCTAT GCCACGAAGC GTGTTGGCGG TGACGTCACC
GATTTCGCAG CGATCAAACT GATGAAGTTC GGCACGTCCT GA
 
Protein sequence
MTETQDRGAS SVPMAEVKTA IGGFLSEFNQ FQDDMNAKFQ KQEDRIAMLN TKTNTFQRPA 
LSADVDTTAP HKLALKSYLR CGDDDALRGL ELEGKAMNTA VNAEGGYLVD PQTSEMIQSV
LRSSSSLRSV ANVVTVEATS FDVLIDSTDT GAGWADEVSN TTETDTPTIE RISIALHELS
ALPKASQRLL DDAAFDIEGW LAGRIADKFA RAEAASFITG DGSGKPTGML THPTVDNDSW
SWGNLGYVAT GTAGDFDNTN AADAIVDLVY ALGARYRANA NFIMNSKTAG AVRKMKDADG
RFLWSDGLAA GEPARLMGYP VLIAEDMPDI ATDAMAIAFG DFGAGYTIAE RPDLRVLRDP
FSAKPHVLFY ATKRVGGDVT DFAAIKLMKF GTS