Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2255 |
Symbol | |
ID | 5713908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 2376242 |
End bp | 2378218 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641268177 |
Product | putative capsule polysaccharide export protein |
Protein accession | YP_001533592 |
Protein GI | 159044798 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3563] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.729539 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCGTTT CTTCGCTCAG CCACCTGCGG GATCGGGACC TCCGCCGCAT TCTCATCCTT GCGGGCTCGC GATTGACCAT GGCCAGACCG GGGACCACAC GCCCGCTTGC GCTTTGGGGC AATGGCGGCA AGAGCGCGCG GCGCGGAACC GCTCTTGCCA GATGCCTCGG CGCGCCTTGC GTCTATCTCG AAGACAGCTT CCTGCGCTCG GTTCTGCCCG GGCGCGCGGG CGCACCGGTC CATGGTCTCA CCTTCGACCG GCAACGCCCG TTTTATGACA GCACCGGTCC CAGCGATCTT GAGGACATGC TCAACACCGC GCCCCTGTCG GACACCGTGC GGGGGAAGGC GATTATAGAC GCGCTGATCG AAGCCGATCT GAGCAAGTAC AACACCCACG ATCCGACCCT GCCCGCTCCG GCGGGGCCCT ATGTCCTCGT GATCGACCAA CTGCGCGGCG ATGCATCGAT TGCCGGCGCG GGCGCGGATG CCGCGCGTTT TACCGAGATG CTGGAGGCCG CACGCGCAGA CCACCCGGGC AAGCAGATCA TCGTGAAGGG TCACCCGGCA GCGACCGACG CGCGGCCGGG GCATTTTGGC TCCGCATTTT CGGACCCGGT CTCGCCCTGG ACCCTGATCC GGGGCGCCGA TGCGGTCTAT ACGGTCAGCT CGCAGATGGG GTTCGAGACG ATCCTATCCG GCAAGCGGCC GCAGATCTTC GGCACACCTT TCTATGCGGG CTGGGGTCTC AGCGACGATC GCGTCCCGGT TCCCCGGCGG AAGCGAACTC TCGATCGCGC GCAACTTGCA CTGGCTGTGC TCGGCCTCTA CCCGATCTGG TATGACCCCA GCCGCGACCG ACTCTGCACG GTCGAAGAGG TGATGGCGGC CCTTGCAGCC CGCGCCCGCG CCTGGCGGCA GGACCGGGCC GGATATGTCG CCCAGGGCAT GCGCCTGTGG AAACGCGCAT CCATGACCGC CTTCCACGGG GACGGACCTG TGCGCTATGT CTCTGGCGAA CAAGAGGCAC TGGACCTTGC CGACCGCAGC GGTCGCCCGA TCCTGAGCTG GGCCAGCCGC ACGCCACCGG CCATTGTCGA TGCGGCGCGC CAGCGCGATA TGCCTCTGCT GCGGGTCGAG GATGGCTTCC TGCGCTCCCG CGGACTGGGC GCGAACCTCG TGCCGCCGCT GTCCCTCGTC CATGACCGGC GCGGCATTTA CTATGACCCG ACTGTCCCCA GCGATCTGGA GCATCTGATT GCGCGCCGTG CCGCAATCGG CAGCACTGCC CGTGGGCGCG TCCTGCTGCA CCGGCTGCGG GCCTCCGGAC TGAGCAAATA CAATCTCGAC CTGCCCGGCT ACAGCCCGCC TGACGCGCAC CGCCGCTGCA TCCTGGTCAT CGGACAGGTC CGCGATGATG CCTCCGTGTT GCTCGGACAG AGTGGGGTTA TCCCCGACAT CGAACATTTG CTCTTGGCCG CCCGAACCGC CAATCCCGAC GCCCATATCG TCTACAAACC CCACCCGGAC GTGGAGGCTG GCCTACGAGA CGGGTCGATC GAGAGCGAAC AGGCCAATGA GATCGCGCGG CGGGTCGACC CGGTATCGGT ACTGGCAGCC GCGGCGGAGG TCTGGACCCT GTCATCGCTG ATGGGGTTCG AGGCACTTCT TCGGGGCAAG CCGGTGGTCT GCGCGGGGGT TCCTTTCTAT GCGGGCTGGG GTCTAACCCG AGACCTCGCC TCTGCCGATC ACCCGGCCTT CGCGCGGCGA ACGGCTCGAC CGGATCTGGC CGCACTGGTC GAAGCATGTC TGATCGACTA CCCGCGCTAT CACGATCCAG TCTCGAACCT GCCCTGCCCT GTCGAAACCG TACTCGATCG GCTGGAACAG GAAACCGAAG CAGCGCATAA GCCGTCCCTG CGTTTGCTTG CCAAGCTGCA GGGACTTCTC GCGTCGCAAT CGTGGATCTG GCGTTAG
|
Protein sequence | MCVSSLSHLR DRDLRRILIL AGSRLTMARP GTTRPLALWG NGGKSARRGT ALARCLGAPC VYLEDSFLRS VLPGRAGAPV HGLTFDRQRP FYDSTGPSDL EDMLNTAPLS DTVRGKAIID ALIEADLSKY NTHDPTLPAP AGPYVLVIDQ LRGDASIAGA GADAARFTEM LEAARADHPG KQIIVKGHPA ATDARPGHFG SAFSDPVSPW TLIRGADAVY TVSSQMGFET ILSGKRPQIF GTPFYAGWGL SDDRVPVPRR KRTLDRAQLA LAVLGLYPIW YDPSRDRLCT VEEVMAALAA RARAWRQDRA GYVAQGMRLW KRASMTAFHG DGPVRYVSGE QEALDLADRS GRPILSWASR TPPAIVDAAR QRDMPLLRVE DGFLRSRGLG ANLVPPLSLV HDRRGIYYDP TVPSDLEHLI ARRAAIGSTA RGRVLLHRLR ASGLSKYNLD LPGYSPPDAH RRCILVIGQV RDDASVLLGQ SGVIPDIEHL LLAARTANPD AHIVYKPHPD VEAGLRDGSI ESEQANEIAR RVDPVSVLAA AAEVWTLSSL MGFEALLRGK PVVCAGVPFY AGWGLTRDLA SADHPAFARR TARPDLAALV EACLIDYPRY HDPVSNLPCP VETVLDRLEQ ETEAAHKPSL RLLAKLQGLL ASQSWIWR
|
| |