Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0646 |
Symbol | |
ID | 3918071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 684636 |
End bp | 686372 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640443377 |
Product | phage terminase |
Protein accession | YP_495927 |
Protein GI | 87198670 |
COG category | [R] General function prediction only |
COG ID | [COG4626] Phage terminase-like protein, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0135346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGGC CAGTGTGGTC GACTGCCTGC CAGGATTGGC GGGAGCGCAT CGTGGCGCGC GAGAGCATCG CGCCATGCGG GCCGCTGTTC CCGGCGAAAG CCGCGGATGC GTTGGGGGTG TTCAAGTCAC TGCAGGTCAC CGACCTGCCT AAAAGGGAGA ATGGGACCCA CCCGACGTTG GGTGAAGTCT GCGACCAGTT CGTTTTTGAC CTGGTTGCGG CCATCTTCGG CGCCGAGGAT CCGGAAACTG GCGAGCGGTT GATCAAGGAG TTCATGCTCC TGATCAGCAA GAAGAATGGC AAGTCGATGA TCGCAGCGGG CATCATGGTG ACCGCGTTGA TCCTCAACTG GCGTCCGAAC GCGATCCTGC AGATCCTTGC GCCCACGATC GAGGTTGCGA ACAACAGCTT TGAGCCTGCC ATGGGGATGG TAAGAGCGGA CGCCGAGCTG GCGATTGTCC TGAAGGTTGT CGAGCATCAG CGACAGATCA AGCACCTGAC GACGGGTGCG GTATTGCGGG TCATTGCCGC CGACAGCGAT ACCGTAGCTG GCGGCAAGGC GGCGATGACG TTGATCGAAG AGCTCTGGCT CTTCGGGAAG AAGGCCAAGG CAGCCGCGAT GTTGCGAGAG GCGCTGGGTG GCGGATCGGC AAGGCCGGAA GGCTTTACGC TCTACATCAC GACCCATTCG GACGAACCAC CGGCGGGCGT GTTCAAGACC AAGCTGTCGT ACTTCCGCGA TGTGAGGGAT GGTGAGATCG AGGATCCTGC TACCTTCCCG ATGCTGTACG AGTGGCCGGA AGACCTGTTG GAGTGCGAGT CCTACCTCGA TCCAGAGTTC TTCTACGTCA CGAATCCACA CGTTGGCCGA TCGGTCTCGA TTGAATGGCT CAAGTCGGAA CTGCAGAAGG AACAGATCGG CGAAGGTGAA GGCCTTCAGA TCTTCCTGGC GAAGCATCTG AACGTCGAGA TTGGTTTGCG GTTGCGGCGC GATCGTTGGG GCGGTGCTGA GCTTTGGCTC GATGCAGCGA ATGATGACCT TGATCTCGAC CAGCTGCTTG AGCGCTGCGA AGTGGCGATC GTTGGGCTCG ACATGGGCGG CCGGGACGAC TTGGCCGGTG CCGGCGTGGT CGGTCGCGAA AAGGGAACGG GTATATGGCT GGGCTGGGCG CATGCCTGGG CGCAGCGGGT TGCGCTGGAG CGGCGCAAGC AGGTGGCGCC GACGCTGCAA GGCTTTGCGG CTGAAGGCGA CCTGACCTTC ACCGATTCCG GTGAGGAAAT CGTGAGCGCC ATGGCGCGCC TTGCAATTCG GGTCCGCGAC AGCGGCAAGA TGCCTGCGGA TGGCGGGGTT GCGGTCGATG CCTGGGGCAT CGGTCCACTC GTCGATGCGC TGGTGCAGGC CGGGTTCGAT CCTGGCGACG AGGCAATGAA GCGCGCGGGG CATATCGCCT CGATCAGGCA GGGTGTTGGC CTGTCGAGCG CGATCTACAC GCTGGAATTC AAGCTCGGCG ACGGGATGTT CCGTCACGAC GGTTCGAACA TGATGGCCTG GTGCGTGAGC AACGCGCTGG TCAAGCTCAG GGGCAGTGCC TTGTACGTCG ACAAAGAGAC ATCAGGCGCG GGCAAGATCG ACCCGTTCGT GGCGCTGCTC AATGCAGTGA AGCGTATGGA AGAGGGCCCG GTGGCCGTGG CTGGCGGCGT CGATAGCTGG CTCGCCAGCT TGCGTGGTGC GGCGTGA
|
Protein sequence | MARPVWSTAC QDWRERIVAR ESIAPCGPLF PAKAADALGV FKSLQVTDLP KRENGTHPTL GEVCDQFVFD LVAAIFGAED PETGERLIKE FMLLISKKNG KSMIAAGIMV TALILNWRPN AILQILAPTI EVANNSFEPA MGMVRADAEL AIVLKVVEHQ RQIKHLTTGA VLRVIAADSD TVAGGKAAMT LIEELWLFGK KAKAAAMLRE ALGGGSARPE GFTLYITTHS DEPPAGVFKT KLSYFRDVRD GEIEDPATFP MLYEWPEDLL ECESYLDPEF FYVTNPHVGR SVSIEWLKSE LQKEQIGEGE GLQIFLAKHL NVEIGLRLRR DRWGGAELWL DAANDDLDLD QLLERCEVAI VGLDMGGRDD LAGAGVVGRE KGTGIWLGWA HAWAQRVALE RRKQVAPTLQ GFAAEGDLTF TDSGEEIVSA MARLAIRVRD SGKMPADGGV AVDAWGIGPL VDALVQAGFD PGDEAMKRAG HIASIRQGVG LSSAIYTLEF KLGDGMFRHD GSNMMAWCVS NALVKLRGSA LYVDKETSGA GKIDPFVALL NAVKRMEEGP VAVAGGVDSW LASLRGAA
|
| |