Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3119 |
Symbol | |
ID | 3918161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 3332560 |
End bp | 3333729 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640445903 |
Product | Phage portal protein, HK97 |
Protein accession | YP_498388 |
Protein GI | 87201131 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTCT TTCAAACGCT TGCCGCCGCC TTCAAGGGCG AGGCGGTGAC GCCGCGTGCG CCGCTTGGAC GCACCTTCGT ATCGCCCTGG ATCGCGGCTT CGGACTGGCG GGGCGAGACC TCGCGCGGGC CGATCAACTA TCCGGTGGCA GTGCGCGAAG CCTATCTCAG GAACCCGGTG GCGCAGCGCG CGGTGCGGCT GGTTGCGGAA GGCATCGGCG GCGCACCGAT CATCGCATCG GCCGCGGAGC TGCGGGCCCT GGTCGGCGAG ACGAGCGCGG GGCAACCCCT GCTGGAGACG CTGGCTGCGC ACCTGCTGTT GCACGGCAAC GGTTACGTCC AGATTCTGCG CGACGCGGAC GGGGTGCCGG TGGAACTGTT CGCGCTGCGG CCGGAGCGCG TAACGGTGCT GCCGGACGCG AGCGGATGGC CGGCGGCCTT CGCCTATCGC GTGGGCGAGC GGACGATGCG GATCGAGGCG GTGGACGACC TTGACCAGCC GAACCTGATC CACGTGCGCC ACTTCCACCC GCTGGACGAC CACTATGGCG CGGGCTGCCT GGAGGCCGCC GAGGAAGCGG TCGCGATCCA CAACGCGGCA GCGCGGTGGA ACCGGTCGCT GCTGGAAAAC GCCGCGCGTC CTTCGGGTGC ACTCGTCTAT GACCCCGGCG ATCCGGGAGC GGCGCTTTCG GCCGATCAGT TCGAACGGAT CAAGGCCGAG CTGACTGCCG CCTATTCGGG CACCGGCAAT GCCGGACGCC CGATGCTGCT GGAAGGCGGC CTGAGGTGGC AGGCGCTGGC GCTGACGCCG GCGGACATGG ACTTTGCGAC GCTGAAGGCC GCGGCGGCGC GCGACATCGC GCTGGCCTTT GGCGTGCCGC CAATGCTGCT CGGCATTCCG GGCGACAACA CTTATGCAAA TTACCGCGAG GCCAATCGCG CACTGTGGCG ACTGACGCTG TTGCCGGTTG CGGGCAAGAT CCTTGCCGCG CTGGCTGAAG GGCTGCGATC GTCCTTCGCG GAGGCGAGCC TGGCCGTGGA CATGGACGGC ATTCCGGCCC TGGCCGAGGA TCGCGAGAGA CTGTGGTCGC AGGTGAGCGG CGCGGATTTC CTTTCCGCCG ACGAAAAGCG GGCAATGCTT GGCCTTGACC GGAACGGGGT GGCGGCATGA
|
Protein sequence | MSFFQTLAAA FKGEAVTPRA PLGRTFVSPW IAASDWRGET SRGPINYPVA VREAYLRNPV AQRAVRLVAE GIGGAPIIAS AAELRALVGE TSAGQPLLET LAAHLLLHGN GYVQILRDAD GVPVELFALR PERVTVLPDA SGWPAAFAYR VGERTMRIEA VDDLDQPNLI HVRHFHPLDD HYGAGCLEAA EEAVAIHNAA ARWNRSLLEN AARPSGALVY DPGDPGAALS ADQFERIKAE LTAAYSGTGN AGRPMLLEGG LRWQALALTP ADMDFATLKA AAARDIALAF GVPPMLLGIP GDNTYANYRE ANRALWRLTL LPVAGKILAA LAEGLRSSFA EASLAVDMDG IPALAEDRER LWSQVSGADF LSADEKRAML GLDRNGVAA
|
| |