Gene Dshi_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1041 
Symbol 
ID5711009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1075005 
End bp1076324 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content63% 
IMG OID641266952 
ProductTRAP dicarboxylate transporter 
Protein accessionYP_001532384 
Protein GI159043590 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1593] TRAP-type C4-dicarboxylate transport system, large permease component 
TIGRFAM ID[TIGR00786] TRAP transporter, DctM subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.128417 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCATGG AATGGTGGGA AGCGGCCCTT CTGATGCTCG GGATGGTGAT CGGGCTGATG 
GCGCTTTGTG TGCCGGTGGC CTTTGCCTTC CTCATCGCGA ACCTGATCGG GGCCTATATC
TTCATGGGCG GGCTGATCGG GGTCGAGCAG CTGGTCGCCA ATACCGGGGA GGCTGTGTCG
AGCTTCGTGC TGGTCACGGT GCCGATGTTC GTGCTGATGG GGAACCTGTT TTTCCATTCC
GGCATCGCGC TGAAGATCAT CGAGACGCTC GACCGATCCA TGGGCCGCTC CACCGGGCGG
CTCAGCTACA TATCCGTTCT CTGCGGAACA ATCTTCGCCG CCCTGTCGGG GTCCAACATG
GCCAACACGG CGATGATGGG GGGGCTGCTG CTGCCGCAGA TGGAAGAACG GAAGTACCAG
CGCCATATGT CCATCGGCCC GATCATCGGC TCGGGGGGAC TGGCACTGCT GATCCCGCCC
TCGACACTCG CGGTGCTCTT GGGCTCCATC GCGCAGATCA GCATCGCGGA CCTGCTGCTG
GCGGGTGTGC TGCCGGGGCT GGTGCTCGCG CTGCTCTACG TGGCGACGAT CTGGCTGCAA
CTGCGGCGCA ACCCTCAGGC CGCGCCCGCC TATGACGTGG TGACCGCGCC GTTCTGGGAA
AAGATCCGCC TGATCTGCAC CTATATCCTG CCCATGTCTC TGGTGGTGTT CTGTGTTGTC
GGGTTGATCC TGCTGGGCAT CACCACCCCG TCGGAGGCCG CGGCCTTCGG CGTGCTGTCG
GTGCTTGTGC TGTCGATCCT TTATGGCCGG TTCTCATGGG ACATGGTCGC GAAATCCCTC
GAAGGCACCC TGCGCGTGTC GGTCATGGTG TTCTTCATCA TCATCGCCTC GAAAACCTTC
AGCCAGGTGC TGGCGTTTTC CGGGGCGACC TCGGGGATGA TCGCATGGGC GACCTCCTAC
GAATTCGCGC CGATCACCAT GCTGTTGATC ATGTTCGTCG TGCTGCTGAT CCTGGGGATG
TTCGTCGATG CGATCTCGAT GATGCTGCTG ACGATCCCGA TCTTCTTTCC CATCGCCGCC
GCCATGGGGT TCGATCCGAT CTGGTTCGGC CTCGTGATGC TGCTGGCCAT CGAGATGAGC
GGAACGACAC CGCCCTTTGG CCTGTTGTTG TTCGTGATGC AGGGGGTGGC GCCGCCGGGC
ACGACCTATT GGACCATCGT GCGGGCGGCC GCGCCTTACC TGATCTGTGA CCTGATCCTG
CTGGTCGGGC TGATCGCGGT GCCTGCGCTG GCGCTGTGGC TGCCGGGGCT GAGGTTCTGA
 
Protein sequence
MLMEWWEAAL LMLGMVIGLM ALCVPVAFAF LIANLIGAYI FMGGLIGVEQ LVANTGEAVS 
SFVLVTVPMF VLMGNLFFHS GIALKIIETL DRSMGRSTGR LSYISVLCGT IFAALSGSNM
ANTAMMGGLL LPQMEERKYQ RHMSIGPIIG SGGLALLIPP STLAVLLGSI AQISIADLLL
AGVLPGLVLA LLYVATIWLQ LRRNPQAAPA YDVVTAPFWE KIRLICTYIL PMSLVVFCVV
GLILLGITTP SEAAAFGVLS VLVLSILYGR FSWDMVAKSL EGTLRVSVMV FFIIIASKTF
SQVLAFSGAT SGMIAWATSY EFAPITMLLI MFVVLLILGM FVDAISMMLL TIPIFFPIAA
AMGFDPIWFG LVMLLAIEMS GTTPPFGLLL FVMQGVAPPG TTYWTIVRAA APYLICDLIL
LVGLIAVPAL ALWLPGLRF