Gene Dshi_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1043 
Symbol 
ID5711011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1076851 
End bp1077846 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content68% 
IMG OID641266954 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_001532386 
Protein GI159043592 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000669152 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTGC ATACCCGACT TCTCGCCGCC GGCGCGGCCC TTCTTCTGGC GGCCCCGGCG 
CTGTCCCAGG AGGCCCGGCT GAGCGTGGTC TACTCGCTGC CTGCGACAAA CGACCTGATG
CAAAGCTATT TCGCCTTCGT CGAGGACGTG AACGCCAATG GCGCGGGCAT CCTGCAAATC
GACCTGCGCG GCGGCACCGA GATCCTGCCC CGGAACGAGC AGATGAACGC GGTCTCGCGC
GGGATCATCG ACCTCTATTT CGGGCCGGCA GGCTATTACC AGCGCCAGGT GCCGGAGCTG
ACCCCGCTCG ACGCCGCCGC GGTGCCTGCC GACAAGCTGC GCGCCGCGGG GCTGCACGAC
GCCATCGATG CCGGCACGCG GGAGCGCGCG GGCGTGGCGT TCCTGGGCGC GATGGGGACG
GGATACAATT TCCAGTTCTA CACCATCACC GAGCCCAAGA TCGACGATGA CGGCACGATG
GATTTCTCGG GGCTCAAGAT CCGCGGCGGC GCATCCTATG ACCCGATGTA CCAGGCGCTC
GGCATCGCCC GGGTCGATGT GCCCGCGGGC GATATCTATA CCGCGCTGGA ACGCGGGCTG
GTCGAGGGGA TCGGGTTCAC CACCATCGGG GTCAGCTCCG GCGGATGGCA GGATTTCCTG
CGCTACCGGA TCTTCCCGAC CTGGCGCCAG GGCAACACGA TCATCGCCGC GAACGCCGCG
AAATTCGACG GGCTGACCGA GGAGCAGCGC GCCTACCTGA TGGAGATGAT CCAGAAGCAC
GAGATGCTGG CCTATGACGC CGCCAAGGCG CTGGAGGCGG TGGATACCGC CGCCCTGGCC
GAGGCGGGCG TGCAGGATGT CGTGCTCGAA GGCGCGGGCG CCGCCGAGGT CACCGCCGCC
TTCCAGGACA CGTTCTGGGT CAACGTGGCC GAGACCCTGG GCGAGGACGC GGCCGCCAAG
TACCGCGCCA TCATCGACGC GGCCAACGGC AGCTGA
 
Protein sequence
MTLHTRLLAA GAALLLAAPA LSQEARLSVV YSLPATNDLM QSYFAFVEDV NANGAGILQI 
DLRGGTEILP RNEQMNAVSR GIIDLYFGPA GYYQRQVPEL TPLDAAAVPA DKLRAAGLHD
AIDAGTRERA GVAFLGAMGT GYNFQFYTIT EPKIDDDGTM DFSGLKIRGG ASYDPMYQAL
GIARVDVPAG DIYTALERGL VEGIGFTTIG VSSGGWQDFL RYRIFPTWRQ GNTIIAANAA
KFDGLTEEQR AYLMEMIQKH EMLAYDAAKA LEAVDTAALA EAGVQDVVLE GAGAAEVTAA
FQDTFWVNVA ETLGEDAAAK YRAIIDAANG S