Gene Dshi_2255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2255 
Symbol 
ID5713908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2376242 
End bp2378218 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content66% 
IMG OID641268177 
Productputative capsule polysaccharide export protein 
Protein accessionYP_001533592 
Protein GI159044798 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3563] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.729539 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGTTT CTTCGCTCAG CCACCTGCGG GATCGGGACC TCCGCCGCAT TCTCATCCTT 
GCGGGCTCGC GATTGACCAT GGCCAGACCG GGGACCACAC GCCCGCTTGC GCTTTGGGGC
AATGGCGGCA AGAGCGCGCG GCGCGGAACC GCTCTTGCCA GATGCCTCGG CGCGCCTTGC
GTCTATCTCG AAGACAGCTT CCTGCGCTCG GTTCTGCCCG GGCGCGCGGG CGCACCGGTC
CATGGTCTCA CCTTCGACCG GCAACGCCCG TTTTATGACA GCACCGGTCC CAGCGATCTT
GAGGACATGC TCAACACCGC GCCCCTGTCG GACACCGTGC GGGGGAAGGC GATTATAGAC
GCGCTGATCG AAGCCGATCT GAGCAAGTAC AACACCCACG ATCCGACCCT GCCCGCTCCG
GCGGGGCCCT ATGTCCTCGT GATCGACCAA CTGCGCGGCG ATGCATCGAT TGCCGGCGCG
GGCGCGGATG CCGCGCGTTT TACCGAGATG CTGGAGGCCG CACGCGCAGA CCACCCGGGC
AAGCAGATCA TCGTGAAGGG TCACCCGGCA GCGACCGACG CGCGGCCGGG GCATTTTGGC
TCCGCATTTT CGGACCCGGT CTCGCCCTGG ACCCTGATCC GGGGCGCCGA TGCGGTCTAT
ACGGTCAGCT CGCAGATGGG GTTCGAGACG ATCCTATCCG GCAAGCGGCC GCAGATCTTC
GGCACACCTT TCTATGCGGG CTGGGGTCTC AGCGACGATC GCGTCCCGGT TCCCCGGCGG
AAGCGAACTC TCGATCGCGC GCAACTTGCA CTGGCTGTGC TCGGCCTCTA CCCGATCTGG
TATGACCCCA GCCGCGACCG ACTCTGCACG GTCGAAGAGG TGATGGCGGC CCTTGCAGCC
CGCGCCCGCG CCTGGCGGCA GGACCGGGCC GGATATGTCG CCCAGGGCAT GCGCCTGTGG
AAACGCGCAT CCATGACCGC CTTCCACGGG GACGGACCTG TGCGCTATGT CTCTGGCGAA
CAAGAGGCAC TGGACCTTGC CGACCGCAGC GGTCGCCCGA TCCTGAGCTG GGCCAGCCGC
ACGCCACCGG CCATTGTCGA TGCGGCGCGC CAGCGCGATA TGCCTCTGCT GCGGGTCGAG
GATGGCTTCC TGCGCTCCCG CGGACTGGGC GCGAACCTCG TGCCGCCGCT GTCCCTCGTC
CATGACCGGC GCGGCATTTA CTATGACCCG ACTGTCCCCA GCGATCTGGA GCATCTGATT
GCGCGCCGTG CCGCAATCGG CAGCACTGCC CGTGGGCGCG TCCTGCTGCA CCGGCTGCGG
GCCTCCGGAC TGAGCAAATA CAATCTCGAC CTGCCCGGCT ACAGCCCGCC TGACGCGCAC
CGCCGCTGCA TCCTGGTCAT CGGACAGGTC CGCGATGATG CCTCCGTGTT GCTCGGACAG
AGTGGGGTTA TCCCCGACAT CGAACATTTG CTCTTGGCCG CCCGAACCGC CAATCCCGAC
GCCCATATCG TCTACAAACC CCACCCGGAC GTGGAGGCTG GCCTACGAGA CGGGTCGATC
GAGAGCGAAC AGGCCAATGA GATCGCGCGG CGGGTCGACC CGGTATCGGT ACTGGCAGCC
GCGGCGGAGG TCTGGACCCT GTCATCGCTG ATGGGGTTCG AGGCACTTCT TCGGGGCAAG
CCGGTGGTCT GCGCGGGGGT TCCTTTCTAT GCGGGCTGGG GTCTAACCCG AGACCTCGCC
TCTGCCGATC ACCCGGCCTT CGCGCGGCGA ACGGCTCGAC CGGATCTGGC CGCACTGGTC
GAAGCATGTC TGATCGACTA CCCGCGCTAT CACGATCCAG TCTCGAACCT GCCCTGCCCT
GTCGAAACCG TACTCGATCG GCTGGAACAG GAAACCGAAG CAGCGCATAA GCCGTCCCTG
CGTTTGCTTG CCAAGCTGCA GGGACTTCTC GCGTCGCAAT CGTGGATCTG GCGTTAG
 
Protein sequence
MCVSSLSHLR DRDLRRILIL AGSRLTMARP GTTRPLALWG NGGKSARRGT ALARCLGAPC 
VYLEDSFLRS VLPGRAGAPV HGLTFDRQRP FYDSTGPSDL EDMLNTAPLS DTVRGKAIID
ALIEADLSKY NTHDPTLPAP AGPYVLVIDQ LRGDASIAGA GADAARFTEM LEAARADHPG
KQIIVKGHPA ATDARPGHFG SAFSDPVSPW TLIRGADAVY TVSSQMGFET ILSGKRPQIF
GTPFYAGWGL SDDRVPVPRR KRTLDRAQLA LAVLGLYPIW YDPSRDRLCT VEEVMAALAA
RARAWRQDRA GYVAQGMRLW KRASMTAFHG DGPVRYVSGE QEALDLADRS GRPILSWASR
TPPAIVDAAR QRDMPLLRVE DGFLRSRGLG ANLVPPLSLV HDRRGIYYDP TVPSDLEHLI
ARRAAIGSTA RGRVLLHRLR ASGLSKYNLD LPGYSPPDAH RRCILVIGQV RDDASVLLGQ
SGVIPDIEHL LLAARTANPD AHIVYKPHPD VEAGLRDGSI ESEQANEIAR RVDPVSVLAA
AAEVWTLSSL MGFEALLRGK PVVCAGVPFY AGWGLTRDLA SADHPAFARR TARPDLAALV
EACLIDYPRY HDPVSNLPCP VETVLDRLEQ ETEAAHKPSL RLLAKLQGLL ASQSWIWR