Gene Dshi_3647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3647 
Symbol 
ID5714177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009955 
Strand
Start bp46205 
End bp47584 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content66% 
IMG OID641276565 
Productconjugation TrbI family protein 
Protein accessionYP_001541861 
Protein GI159046189 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2948] Type IV secretory pathway, VirB10 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0441061 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA CCGAGAATAC CGAGCTGGAA AAGCGCCTCG CCGCCCTGGA GAAAGGCAGT 
GCCCGGGCCC CCATAGCGGC GCAGCGCCGG TCGCCCCTTC TCGCGCTGAT CGTGATCCTC
GTCATCGGCG CGGGTGGTGC CCTGCTTTAT CTCCTCTCAC AGCCCGACGA AGAGGAAGCC
TTGCCGACGG CCACCCCGGA CGTCTTCCAA AACGAGGGGG ACGGCTTTGG CGCCATCGAG
ACCTTGCCCC CGCCCGAGCC CGAAGTGGTG TTCGTCGCAC CCGATCCGGT CGAGCCCAAT
GTCGAGCTTC TGGCGCAGAT CACCGCCCTG CAGGCCCAGA TCGAAGAATT GCGCAACGCC
CCCGAGGCGG TCGTCGAGGA AGACACCGCC GCCGCAGAGG CGATCGACGC GCTGACCGCT
CAGATTGCGG CGCTACAAGC CGCCTCGGAA GCAGCACAGC AACGATTTCA GGACGAACTG
ACGGCGCGGG ATCGCAGCCT TGAGCAACTC CGCATGGATC TGGAATTGGC CCAACTCGAG
GCCAGCCGAC CCCAACCCGC GCCAACGGGC CCCACGGAAG ATGAGCTGCG CGCGCGCGAG
CAAGAGCGAC TGCGCCGCGA GGAAGAAGCC CGGCGCATGG CCGAGCTGGA GCGCCGCGCC
GCGGAGGAAC GCGCCTTCCA GGAGCGGCGC ATCACCTCGC CCACCATCGC GTTTGGCGGT
GCCTCCGGAG CGAATGAAAC GGCTCTGACC GAACGCACTT TTGGCGAGGT GACGGATTTC
GTGCTGAACG GGGCGCTGCC CTCGACGGTG ACGCAGGCCG AGGTGATCGC CAATCCTTCC
AACACCATCA TCCAGGGCAC CATGATCCAA GCCGTCATGG AAACTGCCCT TGACAGCTCC
CTGCCCGGCC AGACCCGTGC CGTGGTGTCC GAAGATGTCT ACAGCGTCGA TGGCGTGCGC
CTCCTGATCC CGCGCGGATC GCGTCTCATT GGGCGTTACC GCGCTGGCGT CGATATCGCG
CAGCGCCGCG TCACGATCGC CTGGGACCGG ATCATCCTGC CCGCGGGCCA GACCGTCCAG
ATCAGCTCCT TCGGAGGTGA TGAACTGGGC CGTTCTGGCG TCACCGGCCT CATAGACACA
CGCTTCGCCG AGCGTTTCGG GTCGGCCGCC CTGATCTCGC TTATTTCGGC AGCACCTGGT
GCCGCCGCCT CCGAGGTCCA GGATGAGACT GCCGCCGACG CTCTCGAAGA CGTTGGCGAT
GATCTGGCAG ATGCGACCGA CAGCGTCATA GGCGATTACC TCTCCATCGG CCCCGTCATC
TATGTCGACC AGGGCGCGCG CGTCACCGTC ATGGTCGACC GCGATCTGGA GATATTCTGA
 
Protein sequence
MSDTENTELE KRLAALEKGS ARAPIAAQRR SPLLALIVIL VIGAGGALLY LLSQPDEEEA 
LPTATPDVFQ NEGDGFGAIE TLPPPEPEVV FVAPDPVEPN VELLAQITAL QAQIEELRNA
PEAVVEEDTA AAEAIDALTA QIAALQAASE AAQQRFQDEL TARDRSLEQL RMDLELAQLE
ASRPQPAPTG PTEDELRARE QERLRREEEA RRMAELERRA AEERAFQERR ITSPTIAFGG
ASGANETALT ERTFGEVTDF VLNGALPSTV TQAEVIANPS NTIIQGTMIQ AVMETALDSS
LPGQTRAVVS EDVYSVDGVR LLIPRGSRLI GRYRAGVDIA QRRVTIAWDR IILPAGQTVQ
ISSFGGDELG RSGVTGLIDT RFAERFGSAA LISLISAAPG AAASEVQDET AADALEDVGD
DLADATDSVI GDYLSIGPVI YVDQGARVTV MVDRDLEIF