Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3004 |
Symbol | |
ID | 3917440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 3221762 |
End bp | 3223216 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640445783 |
Product | Phage uncharacterized protein-like |
Protein accession | YP_498273 |
Protein GI | 87201016 |
COG category | [S] Function unknown |
COG ID | [COG5410] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01630] phage uncharacterized protein (putative large terminase), C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCCTGA CGCCTGACGA CCTGAAAGCC TGCGAGGTAG AGCTTGCCCG CCGTTCACTG GCCGACTTTG CGCGCATGGC GTGGCCGGTT CTGGAACCTG CAACTCCGCT CAAGTGGGGA TGGGCCTTGG ACGCCATCTG CGAACATCTG GAGGCAGTTT CGAGAGGAGA GAGCAAGCGC TTGCTGATGA ACGTGCCGCC CGGCTCTATG AAGTCGCTTT TGACAGGCGT TATCTGGCCT GCGTATGAGT GGGGGCCGAT GGACCATGCC GAGATGCGCT TCCTCGGCAC AGCGCACAAG CAAGACCTTG CTGTCCGCGA CAACCTGAAG TGCCGTCGCC TGATCCAGTC GCAATGGTAT CAGGAGCGGT GGCCCGTTGT CCTGACCAGT GACCAGAACG CCAAGACCAA GTTCGAGAAC GCGCGAACAG GCTTTCGCGA GGCTATGGCA TTCGAGAGCA TGACCGGTTC GCGTGGTGAC AGGGTGATCT TGGACGACCC ACACAGCGTG GACGATGCGA ACAGCGCGGC CAAACTTGCC AGTGGCGTGA CGACGTTTCG TGAGGCCCTG CCAAGCCGTG TGAACAATGA TCAGTCTGCA ATCGTGATCG TCATGCAGCG CTTGAACGAG GCGGACGTAT CGGCGGTAGC AATCGACCTC GGCTACGATC ATCTTTGCAT TCCGATGCGA TATGAGCCGG GACGGTCGAA GTGGGTCTAT GGCTCCGGTG ATCCGCGCAA GGAAGAGGGG GAGTTGATGT TCCCCGAGCG CTTCCCCGAG GAACAGGTAT CAGAGCTTGA GAAGACCATG GGCAGCTATG CCGTTGCGGG CCAGCTGCAG CAGCGCCCTG CGCCGCGTGG TGGTGGTATC ATCAAGACTG CATGGTTCCG CTCATATCGG GAGCTTCCTG CCTTGGAGTG GCGGCAAATC CATGCAGACA CCGCGCAAAA GACGGGCGAG GAAAACGACT ACAGCGTGTT CCAGTGCTGG GGCCGAACGA CAACCGGCCA GGCGGTATTG ATCGACCAGA TACGCGGCAA GTGGGAGGCT CCCGACCTCC TGACGCAGGC CCGCGCATTC TGGTTGAAGC ACAAGAGCAT TCCCGGCCCG GTTCTTCGCG CGATGAAGGT CGAGGACAAG GTATCTGGCA CGGGCTTGAT CCAGACGTTG CGCCGGGAAG GCGTTGCTGT AATCCCGGTG CAGCGCAACA AGGACAAGAT CAGCCGCGCA TATGATGCTG CCGCGTTCAT CGAAAGCGGT AACGTGCTGT TGCCCGAGTG GGCCGATTGG CTGGATGGCT TCACCAATGA GGCGGCGACC TTCCCGAGCG GCGCGCACGA TGACCAGCTC GATCCGATGT TTGATGCGAT CCATGACGTG CAGTTCGGCA TGACAGTTAG TGCAGCAGTG ACCCGGCCTA TCCCTAGATC GGTAACAGCG TTCAACAAGC GTTAG
|
Protein sequence | MLLTPDDLKA CEVELARRSL ADFARMAWPV LEPATPLKWG WALDAICEHL EAVSRGESKR LLMNVPPGSM KSLLTGVIWP AYEWGPMDHA EMRFLGTAHK QDLAVRDNLK CRRLIQSQWY QERWPVVLTS DQNAKTKFEN ARTGFREAMA FESMTGSRGD RVILDDPHSV DDANSAAKLA SGVTTFREAL PSRVNNDQSA IVIVMQRLNE ADVSAVAIDL GYDHLCIPMR YEPGRSKWVY GSGDPRKEEG ELMFPERFPE EQVSELEKTM GSYAVAGQLQ QRPAPRGGGI IKTAWFRSYR ELPALEWRQI HADTAQKTGE ENDYSVFQCW GRTTTGQAVL IDQIRGKWEA PDLLTQARAF WLKHKSIPGP VLRAMKVEDK VSGTGLIQTL RREGVAVIPV QRNKDKISRA YDAAAFIESG NVLLPEWADW LDGFTNEAAT FPSGAHDDQL DPMFDAIHDV QFGMTVSAAV TRPIPRSVTA FNKR
|
| |