Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0649 |
Symbol | |
ID | 3918074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 688371 |
End bp | 689768 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640443380 |
Product | Phage major capsid protein, HK97 |
Protein accession | YP_495930 |
Protein GI | 87198673 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.296668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGA ACCTGATTGC TGCGGCCCTT GTGGTCGCGG TATCGCTGGT GCTCGCTCCG AGCGCCGCCT TTGCTGCCAC TTTCGCGCAG CAGATCCATC CGTCGCTGGA CATCAACGGT GCTCTTGCCG CGATCGGCAT GCTGGCGGCT GTTGGCAGCA TCAGCGAGTT CGGCCGCAAG CAGGCAGGCG AAGGTGAGGG TGAGCTCAAG CAGCTCGCGG TCGACCTGAA GTCTGCGACG GACCAGGTAA AGACCTTCGC CGAGAAGGCG GATGCGGAAA TGAAGCGTCT CGGCACGGTC ACCGAGGAAA CCAAGAAGAG CGCAGACGAA GCGCTCATCA AGATGAACGA AACGACCGCT CGAATCGATG CGATCGAGCA GAAGCTCGCT CGCCGCGGCG AAGAGGGCGA GAAGCGCCGC GCCAAGACGG CCGGCCAGGA AGTGACCGAG AGCGAGGAGT TCAAGGCTTG GCTCGGCGGC AACCGCAAGA ACACGTTCAG CATGCAGGTG AAGGCGATCA TTTCGTCGCT CACCACCGAC GCTGATGGCT CGGCGGGCGA CCTGATCGTT CCGCAGCGCC AGCCCGGTAT CATTGGCCTG CCGCAGCGCC GCATGACGAT CCGCGACCTG CTCACGCCGG GCAACACCGG TTCGAACGCG ATCCAGTACG TGAAGGAAAC CGGCTTCACC AACAACGCTG CCACCGTGAC TGAAACCGCC GGCACGGCGA AGCCGCAGTC GGAGATCAAG TTCGACATCG TCACCAGCTC GGTCACGACG ATCGCTCACT GGGTGCTTGC GACCAAGCAG ATCCTCGACG ACGTGCCGCA GCTGCGCTCA TACATCGATG GCCGTCTGCG TTATGGTCTG GAGTACGTCG AAGAAGGGCA GCTGCTCAAC GGTGGCGGCA CTGGCACCGA TCTCAACGGC ATCTACACCC AGGCAACGGC TTTCGCGGCG CCAATCACCC CCACCGCCGC CGGCATGATG ACGAAGATCG ACATCATTCG TCTCGCCATT CTTCAGGCAG CTCTCGCGGA ACTGCCGGCC AACGGCATCG TGATGCACCC CAGCGATTGG GCTGACATCG AGCTGACCAA GACCGATGAT GGCGCTTACC TGTTCGCCAA TCCGCAGGGT GGCAGCGAGG CCCGACTGTG GCGCCTGCCT GTCGTCGAAA CGCAGGCGAT GACCGTCGAC AAGTTCCTTA CCGGAGCTTT CCAGATGGGT GCGCAGGTGT TCGATCGCGA AGAAGCCAAC GTCGAGATCT CGACTGAGGA CAGCGACAAC TTCCGCAAGA ACCTGGTCAC CATTCGCGCC GAGGAGCGTC TCGCGCTCGC GGTCTATCGG CCGGAAGCCT TCATCAAGGG CGACTTCAGC GACGCGCTGG CACTCTGA
|
Protein sequence | MKKNLIAAAL VVAVSLVLAP SAAFAATFAQ QIHPSLDING ALAAIGMLAA VGSISEFGRK QAGEGEGELK QLAVDLKSAT DQVKTFAEKA DAEMKRLGTV TEETKKSADE ALIKMNETTA RIDAIEQKLA RRGEEGEKRR AKTAGQEVTE SEEFKAWLGG NRKNTFSMQV KAIISSLTTD ADGSAGDLIV PQRQPGIIGL PQRRMTIRDL LTPGNTGSNA IQYVKETGFT NNAATVTETA GTAKPQSEIK FDIVTSSVTT IAHWVLATKQ ILDDVPQLRS YIDGRLRYGL EYVEEGQLLN GGGTGTDLNG IYTQATAFAA PITPTAAGMM TKIDIIRLAI LQAALAELPA NGIVMHPSDW ADIELTKTDD GAYLFANPQG GSEARLWRLP VVETQAMTVD KFLTGAFQMG AQVFDREEAN VEISTEDSDN FRKNLVTIRA EERLALAVYR PEAFIKGDFS DALAL
|
| |