Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2174 |
Symbol | |
ID | 5713827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2301398 |
End bp | 2302615 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641268096 |
Product | putative phage major capsid protein |
Protein accession | YP_001533511 |
Protein GI | 159044717 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACA GCGATCCCTC TGCCTTGCTC GCAGCAAAGA CGGCCCCTGA CGCCGTGGCG GACGGGGTGT CTCCGCTCGC CGAAGTCACC GCAGAACTGA GTGCCTTCGT GCGCGAAGTC ACGGCCCAGA AGGCCGAAAT GAAATCCCGA TTGAAACAAC AAGAAGAGCG TCTCCTCATG CTTGACCGCA AATCTACTTC GTACGCGCGC CCCGCGCTGT CTGGTGCTGT TGCCGAAGCA GCGCCCCATC AGAAGGCGTT CGACGCCTAT CTGCGTTCCG GCGATGACGA CGCGCTGCGT GGTCTGGAAC TGGACGGCAA GGCCATGTCC ACGGCGGTGT CCGGCGAGGG CGGCTATCTC GTCGATCCGC AGACCTCCGA GACCGTGAAG AACGTCCTGA AGTCCCAGGC CTCCCTGCGG GCGGTCGCAC AGGTCGTGAC GGTGGAAGCC ACCAGCTATG ATGTGCTGGT CGATCACACC GAGGTCGGCA CCGGCTGGGC GAGCGAGACC GGCAACCTGA CCGAGACCGA CACCCCGCAG ATCGACCGGA TCTCGATCCC GCTGCACGAG CTTGCTGCCC TGCCGAAGGT CAGCCAGCGA CTGCTGGATG ATGCGGCGTT CGACATCGAG GCATGGCTGG CTGAGCGCAT CGCCGACAAG TTCGCCCGCT CGGAAGCGGC GGCCTTCGTC AACGGAGACG GCGCCGACAA GCCCAAGGGG TTCCTTACCC ATGACAGCGT CGACAACGAT GTGTGGACCT GGGGCAATCT CGGCTACGTG GTCACCGGCT CGGATGGTGA TTTCAACAGC GTGAGCCCCG CAGATGCGAT CGTGGACCTG GTTTACGCGC TGGGCGCGCG CTACCGCGCC AATGCCAGCT TCATCATGAA CTCCAAGACC GCTGGTGTCG TGCGCAAGAT CAAGGATGCC GATGGCCGGT TCCTGTGGTC GGACGGCCTT GCCGCGGGGG AGCCGGCGCG TCTGCTGGGC TACCCGGTGC TGATCTCCGA GGACATGCCG GATATCGGCA GCGATGCCAC GGCCATCGCC TTTGGCGATT TCGGCGCCGG CTACACCATC GCGGAGCGTC CGGACCTGCG CATCCTGCGC GATCCGTTCT CGGCCAAGCC TCACGTTCTG TTCTACGCCA CCAAGCGCGT GGGCGGGGAT GTGAGCGATT TCGCGGCGAT CAAACTGCTG AAATTCGCGG TGAGCTAA
|
Protein sequence | MSNSDPSALL AAKTAPDAVA DGVSPLAEVT AELSAFVREV TAQKAEMKSR LKQQEERLLM LDRKSTSYAR PALSGAVAEA APHQKAFDAY LRSGDDDALR GLELDGKAMS TAVSGEGGYL VDPQTSETVK NVLKSQASLR AVAQVVTVEA TSYDVLVDHT EVGTGWASET GNLTETDTPQ IDRISIPLHE LAALPKVSQR LLDDAAFDIE AWLAERIADK FARSEAAAFV NGDGADKPKG FLTHDSVDND VWTWGNLGYV VTGSDGDFNS VSPADAIVDL VYALGARYRA NASFIMNSKT AGVVRKIKDA DGRFLWSDGL AAGEPARLLG YPVLISEDMP DIGSDATAIA FGDFGAGYTI AERPDLRILR DPFSAKPHVL FYATKRVGGD VSDFAAIKLL KFAVS
|
| |