Gene Dshi_1110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1110 
SymboltolA 
ID5711078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1134607 
End bp1135662 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content72% 
IMG OID641267021 
Producthypothetical protein 
Protein accessionYP_001532453 
Protein GI159043659 
COG category[S] Function unknown 
COG ID[COG5373] Predicted membrane protein 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00592448 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.281955 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCGCC TGCCGCATAT CGAGACCGGG ACCTATATCT CCGGCGCGGG GCATCTGGCG 
CTCTTGGCCT GGCTCGCCCT GGGCGGACTT TTTTACTCCG CGCCCGAATT GCCGGTGCCT
TCGGCGGCTG ATGTCGTGTT GTTCAGCGAG GCGGAGTTTG CCGCGATGAC CCGGGCGCCG
GAGGTTGCCG AGCCCGCGCC CGCGCCACCC CCCGTGCCCG CGCCGGCCCC AGAACCCGCC
CCACCCCCCG AACCTGCGCC AGCGCCGGAA CCCGTCCCGC AGCCGGAACC CGAACCCGTG
CCGATCCCCG AGCCGGAGCC CGCGCCGCCG CCCCCGGCAG AGCGCGTGGC ACCCGAGCCC
GTGCCGCAGC CCGAGCCGGA GGCGCAGGTC GCGCCAGAAC GGCAGGAGGC CGTCACCCCC
GACAATTCCG GGGCCGAGGC CGTGCCTGCG GAAGAGGCCA CGGCCCCCGA GGCCGCGACC
ACACGCATTA TCACCGAGGC GACCGAGACC GATCCCGAAA GCCAGGCCCC GGACTTGATC
GCCAGCCCGC GCCCCTCGGC GCGTCCCGAC CGCCCCCGGC CCGTGCCGGT GGAGGCCCCG
CCCACACCCG AAGCGCCGCC GGAAACGCCC GCCGAGACTG CGCCGGACCC GGTCGCCGAT
GCGGTCGCGG CTGCCGTGGC AGAGGCCGCG GAGACCCCGA GCGCGGTGAG CCGCCCGGAT
GTGCCCGTCG GTCCGCCACT CACCGCGTCC GAACGGGACG GTCTGCGCGT CGCTGTTCAG
CAATGCTGGA ACGTGGGTTC GTTGTCGTCG GATGCGCTGC GCACCACGGT CACGGTCGCA
GTGGAGATGG AGCAATCGGG CCGCCCCGTG ATCAATTCCA TCCGCATGAT CGGCTCCGAA
GGCGGCTCGG ACGCGGCGGC GCGCCAGGCC TTCGAGACGG CACGCCGGGC AATCATCCGC
TGCGGCAGCG CCGGTTTCGA TCTGCCGGTT GAGAAATATG CCCAGTGGCG CGATATCGAG
ATGACATTTA ACCCCGAAAG GATGCGGATC CGATGA
 
Protein sequence
MVRLPHIETG TYISGAGHLA LLAWLALGGL FYSAPELPVP SAADVVLFSE AEFAAMTRAP 
EVAEPAPAPP PVPAPAPEPA PPPEPAPAPE PVPQPEPEPV PIPEPEPAPP PPAERVAPEP
VPQPEPEAQV APERQEAVTP DNSGAEAVPA EEATAPEAAT TRIITEATET DPESQAPDLI
ASPRPSARPD RPRPVPVEAP PTPEAPPETP AETAPDPVAD AVAAAVAEAA ETPSAVSRPD
VPVGPPLTAS ERDGLRVAVQ QCWNVGSLSS DALRTTVTVA VEMEQSGRPV INSIRMIGSE
GGSDAAARQA FETARRAIIR CGSAGFDLPV EKYAQWRDIE MTFNPERMRI R