Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0647 |
Symbol | |
ID | 3918072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 686369 |
End bp | 687673 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640443378 |
Product | Phage portal protein, HK97 |
Protein accession | YP_495928 |
Protein GI | 87198671 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.239357 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCAGTC TGGCCCCGCG AACGTCCTGG CTCGCGAAGG CGACGACGGC GCTGCGCGAC TGGCTGGTGC TCGGACCGGA TCCGAAGATC CGCGACCCAC AGAATTCGTC GCGTGGGGGG AACGGGGCGG GCGTAACCGT CAATGACCAG GCTGCGATGC GGCTGAGTGC ATTCTGGGGC TGCGTCCGCC TCATCTCGTC TACGATCGGC TCGCTACCGG TCCCGGTCTA TACCGTCGAC CAGCGCGGAG TTCGCTCGGT CGCTCGGGAG AGTGCACTGT ATCGTGTGTT GCACGATAGC CCGAACGCTG ACCAGACGCC GGTCGATTAC ATGGAATGCG CCGTCATTTC GCTACTTTTG CGTGGCAACC ACTACGCCCG CAAGCTGATG GAAGGTGGCC GACTGGTTGG CCTCGAGCCG ATCAATCCCG CGATCGTCAG CGTCCGCCGG CGTTCCGATG GCAGGATTGG GTATCGTTGG ACCGAAGGCG GTGAAAACTT CGACCTTACC GAGGATGAAG TTTTTCACGT TCGTGGATTC GGCGGCGGGC CGCTGGGCGG GCTTTCGACT GTCGAGTTTG CACGTGAATC GCTGGGCGTA GCGATCGCTG CGGACCGCGC CGCGAGCGCG ATTTTCGCCA ACGGGGTGAA CCCGACAGGA ATCATGTCGA CTGATATGCC GCTGACGGCT GCGCAGCAGG CAGAGGCAGA GGAGTTGATC GTCAAGAAGT ACCAGGGAGC GCACCGCATG GGTGTCCCGA TGGTGCTTGG CCACGGGTTG AAGTGGAATT CAATCACGAT GAAGGCCGAC GACGCCCAGC TGCTGCAAAG CCGGGGTTGG AGCGTAGAGG AGATTTGCAG GTGGTTCGGC GTTCCGCCGT TCATGATCGG TCACAACGAG AAGACCACGA GTTGGGGTAC CGGCATCGAG CAGATGCTGC TGGGCTTCCA GAAATTTACT CTCAATCCCT ACCTGCGACG CATTGAGCAG GCTGTGCGCA AGCAGCTGAT CACTCCGATC GAGCGTGCCC GTGGTCTGAC CGCCGAATTC AATCTTGAAG GCCTCCTGCG GGCCGACAGC GCGGGTCGCG CATCGTTCTA CGACAAGGCG CTCAAGTCGA AGTGGATGGT CATCAACGAA GTCCGGGCAA AGGAGAACCT TGCGCCGGTG CCGTGGGGCG ATGAGCCGAT CGTGCAGCAG CAGGACGTGC CGCTGTCCGA TCAGCTCGAT GCCCTCCGGG AAGCAATCAA GAACGCCCAG GACGTGGCCG GGCTGTTCCA GAAGGGAAAC GCCAATGCAG CGTAA
|
Protein sequence | MSSLAPRTSW LAKATTALRD WLVLGPDPKI RDPQNSSRGG NGAGVTVNDQ AAMRLSAFWG CVRLISSTIG SLPVPVYTVD QRGVRSVARE SALYRVLHDS PNADQTPVDY MECAVISLLL RGNHYARKLM EGGRLVGLEP INPAIVSVRR RSDGRIGYRW TEGGENFDLT EDEVFHVRGF GGGPLGGLST VEFARESLGV AIAADRAASA IFANGVNPTG IMSTDMPLTA AQQAEAEELI VKKYQGAHRM GVPMVLGHGL KWNSITMKAD DAQLLQSRGW SVEEICRWFG VPPFMIGHNE KTTSWGTGIE QMLLGFQKFT LNPYLRRIEQ AVRKQLITPI ERARGLTAEF NLEGLLRADS AGRASFYDKA LKSKWMVINE VRAKENLAPV PWGDEPIVQQ QDVPLSDQLD ALREAIKNAQ DVAGLFQKGN ANAA
|
| |