Gene Dshi_2556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2556 
Symbol 
ID5713453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2720713 
End bp2721852 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content67% 
IMG OID641268479 
Producthypothetical protein 
Protein accessionYP_001533890 
Protein GI159045096 
COG category[R] General function prediction only 
COG ID[COG3500] Phage protein D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00000358753 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGGAGC TGATCAAGAC CCAGAGCCGT GCGCCCGGCG AGTGCCTGAT CCATATCGGG 
GACGCGGAGA TCGTCGATCT CTACCCGTTT CTGATGGAGG TGACGGTCGA TACGGCGCGG
GAGGCGGCCT CCGAAGCCGT CCTGAAATTC GAGACGCGCC GGGATCTCGA CGGCAGCTGG
ATCGTGCAGG ATGACGACCG GATCCGTCCC TGGAAGCCGC TTCGGATCGA GGCCGCGTTC
GGGGACGAAA CGGAAGAGGT CATGCGCGGC TATATCCGCC AGATCGATGT GTCCTTCCCC
GAGGATACAG GGGGCGCGAC GGTCACGTTG GCGGTGCAGG ATGACAGCCT CGCCCTTGAT
CGCACCGCGC GAACCGAGGC GTGGGGCGCG GAGGGGCGGA CCACGGATCA GGCCATCGTC
ACGGACATCC TGTCGCGCAA CGACCTTGTC CCGGAGGGCA TGCCGGCCCT GGGCCAGACC
GACCTCGCGG TGACCCAGGA CGACACCGAT GCGGCCTTTC TGCGCAAGCG GGCGGAGGTG
AATGGTTTCG AGCTGATCTA CCGCCGGGGC GCGGTCTATT TCGGACCCCG GCGCTTGACG
GCGGCGCCGC AGGCCACGGT GAAGGTCTAT GCCGGGCCGG ACACCACCTG TTTGAGCTTT
GCGGTCACCG ATGACGGGAT GAAGCCCGAC GGGGTGGAGT ATGACGTGGC GAGCGCCGAG
GGCGCCAGGA CCGAGACCCG GCGTCTGGCT CCGAACCTCG ATGCGCTGGG CCCGGAGCCA
GCAAGCTCGG TCGCGGCGCT GGACGACGGG TTCGTCTGGA AGATCCGCAA GGAGGGTGAG
AGCGACGCGG CCAAGGCCGA AACCCTTGCG CAGGAGAAGG CCAACGCCAA TGCCATGAAG
ATCAGCGGCA AGGGTGTTCT GGATGGCGCG CTCTATGGTC ATGTGCTGCT GACCGGGCTG
CCCGTGGGCG TGGACGGGGT GGGCAACCGC CATTCGGGCA TCTGGTACGT GGACCGGGTG
CGCCACGTGT TCGACACCAC GGGCTACCGG CAGGAGTTCG AGTTGCAGCG CAACGCCTAT
GGCGACAACC TGCCCGAAAC AGGCGATCCG CTGGCGCGGC TGCGGGGGGT CGGCACATGA
 
Protein sequence
MLELIKTQSR APGECLIHIG DAEIVDLYPF LMEVTVDTAR EAASEAVLKF ETRRDLDGSW 
IVQDDDRIRP WKPLRIEAAF GDETEEVMRG YIRQIDVSFP EDTGGATVTL AVQDDSLALD
RTARTEAWGA EGRTTDQAIV TDILSRNDLV PEGMPALGQT DLAVTQDDTD AAFLRKRAEV
NGFELIYRRG AVYFGPRRLT AAPQATVKVY AGPDTTCLSF AVTDDGMKPD GVEYDVASAE
GARTETRRLA PNLDALGPEP ASSVAALDDG FVWKIRKEGE SDAAKAETLA QEKANANAMK
ISGKGVLDGA LYGHVLLTGL PVGVDGVGNR HSGIWYVDRV RHVFDTTGYR QEFELQRNAY
GDNLPETGDP LARLRGVGT