Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3122 |
Symbol | |
ID | 3918164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 3334561 |
End bp | 3335703 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640445906 |
Product | phage major capsid protein, HK97 |
Protein accession | YP_498391 |
Protein GI | 87201134 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACCG AAATCGAAAC CAAGGCCGAT CCGCTGGCCG GATCGTTCGA CATCGTCGAG CGCCAGGACG CGCACGACGC CGAACTGGCG GCGCTTCGTT CAGAACTGAC CGAAGTGAAG GGACGGCTGG AGAAGGCGGG CCGCCTTGCG CTTCGCGCAC CGCTGGCGGG CGGCGAACCG GGCGGCGCGG CCGATGCCGG ATTTGTCGAC GGCTATATCC GCCACGGTCG CGAGACCGAG CTGAAATCGC TTTCCGGCGC GGTGGCGGTG GATGGCGGCT ATGCCATTCC GCAGAAGATC GACGAGATGA TCGCCCGCCG CATGGTCGAG ATCAGCCCGA TCCGTTCGAT CGCCAATGTG GTCCGCACCG GCACTTCGGG CTTCCGCCGT CTCGTCTCGA CCGGCGGCAC CGCATCGGGC TGGGTCAGCG AGACCGGCGC CCGTCCCGAA ACGGCAAGCC CCAAGCTGGC CGAAATCGCC CCGCCTTCGG GCGAGCTTTA CGCCAATCCT TCGGCCACGC AGGCAATGCT CGAGGATGCA GCCTTCGACG TGGAAAGCTG GCTGGCCGGC GAGATCGCGA CCGAGTTCGC ACGGGCCGAA GGCGCCGCCT TCGTCAGCGG GACCGGCACG AACCAGCCCA AGGGCTTCCT TGCCGCAACG ACGAGCGCCG CAGGCGACAG CAGCCGCGCC TTCGGCTCGC TCCAGTTCAT CGGTTCGGGC AACGCGTCGG GCTTCGACAC CACGCCGGAG GCCAAGCTCA TCGACCTCGT GTGCCAGCTC AAGGCACCGC TGCGCCAGGG TGCAGCATGG GTGATGAATT CGACCACGCT GGCCGCCGTG CGCAAGCTCA AGACCGCGGA CGGAGCGTTC CTTTGGCAGC CGGGCATGGT CGAAGGACAG CCGGACCGCC TGCTCGGCTA TCCTGTGGTC GAGGCCGAGG ACATGCCCGA TGTCGCCGCC AACCAGTTCC CGATCGCCTT CGGCAACTTC CGCGCGGGCT ATCTCGTCAC CGATCGCCGC CAGACCACGA TCCTGCGCGA TCCCTATACC AACAAGCCCT ATGTCCAGTT CTATGCCACC CGCCGCGTCG GCGGCCAGGT CATGGACAGC GACGCGATCA AGCTGCTGAA GATCACCGCC TGA
|
Protein sequence | MTTEIETKAD PLAGSFDIVE RQDAHDAELA ALRSELTEVK GRLEKAGRLA LRAPLAGGEP GGAADAGFVD GYIRHGRETE LKSLSGAVAV DGGYAIPQKI DEMIARRMVE ISPIRSIANV VRTGTSGFRR LVSTGGTASG WVSETGARPE TASPKLAEIA PPSGELYANP SATQAMLEDA AFDVESWLAG EIATEFARAE GAAFVSGTGT NQPKGFLAAT TSAAGDSSRA FGSLQFIGSG NASGFDTTPE AKLIDLVCQL KAPLRQGAAW VMNSTTLAAV RKLKTADGAF LWQPGMVEGQ PDRLLGYPVV EAEDMPDVAA NQFPIAFGNF RAGYLVTDRR QTTILRDPYT NKPYVQFYAT RRVGGQVMDS DAIKLLKITA
|
| |