Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_1635 |
Symbol | |
ID | 3934083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | + |
Start bp | 1622120 |
End bp | 1623301 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637903986 |
Product | phage major capsid protein, HK97 |
Protein accession | YP_509577 |
Protein GI | 89054126 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.304591 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.904057 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGA CCCAGGACCG GGGCGCGAGC AGCGTACCCA TGGCTGAAGT CAAGACCGCG ATCGGCGGAT TCTTGAGTGA GTTCAACCAA TTCCAAGACG ACATGAATGC AAAATTTCAA AAGCAGGAAG ACCGTATCGC TATGCTCAAC ACGAAAACAA ACACGTTTCA ACGCCCCGCA TTGTCGGCGG ATGTCGACAC CACCGCCCCC CACAAGCTGG CGCTGAAATC CTATCTTCGC TGTGGCGATG ATGATGCGCT TCGCGGCCTG GAACTGGAAG GCAAGGCGAT GAACACCGCC GTCAATGCCG AAGGTGGCTA TCTGGTCGAT CCGCAGACGT CCGAGATGAT CCAGTCGGTC CTACGCTCCT CGTCCTCCCT GCGCTCGGTC GCCAATGTGG TGACGGTTGA AGCGACGTCC TTTGACGTGC TGATCGACAG CACCGATACC GGCGCCGGTT GGGCCGATGA GGTGTCCAAC ACGACCGAGA CCGACACGCC CACCATTGAG CGCATCTCCA TCGCGCTGCA CGAGCTGTCG GCGCTGCCCA AAGCGTCCCA GCGTCTTCTG GACGACGCGG CATTTGACAT CGAAGGCTGG CTGGCCGGTC GCATCGCCGA CAAGTTCGCC CGCGCCGAGG CCGCGTCCTT CATCACCGGT GACGGCTCTG GCAAGCCCAC GGGCATGCTC ACCCACCCCA CCGTGGACAA CGACAGCTGG TCTTGGGGCA ATCTGGGCTA TGTCGCGACT GGCACCGCGG GCGATTTCGA CAATACCAAC GCCGCCGATG CAATTGTCGA TCTGGTCTAT GCACTGGGTG CACGCTACCG CGCCAACGCC AACTTCATCA TGAACTCCAA GACCGCCGGT GCGGTGCGGA AGATGAAAGA CGCGGACGGC CGCTTCCTGT GGTCCGATGG TCTGGCCGCC GGTGAGCCCG CGCGTCTTAT GGGCTACCCC GTTCTGATCG CCGAGGATAT GCCCGACATC GCAACCGATG CGATGGCGAT TGCCTTCGGT GATTTCGGCG CCGGCTACAC CATCGCCGAG CGTCCGGATC TGCGTGTGTT GCGTGACCCG TTCTCGGCCA AGCCGCATGT CCTGTTCTAT GCCACGAAGC GTGTTGGCGG TGACGTCACC GATTTCGCAG CGATCAAACT GATGAAGTTC GGCACGTCCT GA
|
Protein sequence | MTETQDRGAS SVPMAEVKTA IGGFLSEFNQ FQDDMNAKFQ KQEDRIAMLN TKTNTFQRPA LSADVDTTAP HKLALKSYLR CGDDDALRGL ELEGKAMNTA VNAEGGYLVD PQTSEMIQSV LRSSSSLRSV ANVVTVEATS FDVLIDSTDT GAGWADEVSN TTETDTPTIE RISIALHELS ALPKASQRLL DDAAFDIEGW LAGRIADKFA RAEAASFITG DGSGKPTGML THPTVDNDSW SWGNLGYVAT GTAGDFDNTN AADAIVDLVY ALGARYRANA NFIMNSKTAG AVRKMKDADG RFLWSDGLAA GEPARLMGYP VLIAEDMPDI ATDAMAIAFG DFGAGYTIAE RPDLRVLRDP FSAKPHVLFY ATKRVGGDVT DFAAIKLMKF GTS
|
| |