Gene Smed_3743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3743 
Symbol 
ID5318733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp186893 
End bp187885 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content63% 
IMG OID640775556 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_001312489 
Protein GI150375893 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.397744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAACT TTCTGAAACA TGGCCGGATA ACCGGCGCCA TGCTGGGGGC CGCTTTATTG 
ACAGGCGCCG CCTCGGCAAC CGAATTGCGC TATGCGCATG TGGGCGCCGA GGGAGACATC
CAGACGGTTT ACGCCGCACA GGCGGCGGAG GGAATTGCGG CGGCGACCGG CGGCGAAGTC
ACCGTCACCG TCTATCCCGC CAGCCAGCTC GGCGGGGTCG CGGAAATGGT GGATGGCGTG
CGCATGGGCT CGATCTCCAT GGGCCATCAT GATTTTGCTT CCCTCGCCCG ACTGGTCCCG
GAGGTCGCCG TCTTCAACGC GCCTTTCATC TATCGCGACG GCGCACATGC GCTCGCTGCA
ACGGACCCGC AGACATCGCC GGCACTCCAG GCGATCAACG AGAAGCTGGT CGCACAGGGC
GTGCGGATCA TCGGGCGCAT CTATCGCGGC GATCGCCACA TTTCCTCAAA TTTTCCGGTG
AAGACTCCCG CGGACCTTGC CGGAAAGCCC TTCCGTGCCG TCCCGCTCGA ATTGTGGGTT
TCCATGGTCA AGGGCTTCGG CGCAATTCCT ACCCCGGTCG AGGTTGCCGA ACTCCCGACC
GCGCTGATGA CGGGCGTGGT GGTCGGTCAG GAAAACCCGC TGACCATGAT CGCCTCCAAC
AATCTCAACG AGGTGCAATC GCATCTGTCA ATGACCGGCC ACATGCGCGC CGTGCTCGCC
GTCTTCATCA ATGAGGACGT CTGGCAGGGA TTGAGTGAAG AGCAGCGCTC GGCCCTCACC
AAGGTCCTCG ACGAGGAGGC CCGGAAATCG CTGAAGATGG CAACGGAATC AGAGGCCGAT
CTGGTGAAGG AACTCAAGGG CCGCGGCATG ACCGTCATAA CGGAGGCCGA AGGGCTCGAC
GTGGCGGCGT TCCGTGAGAA GGTCAGCGCC CAGATCAGAC AGGACTTCCC CGATTTCGCG
CCGCTCATCG AGCAGATCGA GGCGGTGAAG TAA
 
Protein sequence
MLNFLKHGRI TGAMLGAALL TGAASATELR YAHVGAEGDI QTVYAAQAAE GIAAATGGEV 
TVTVYPASQL GGVAEMVDGV RMGSISMGHH DFASLARLVP EVAVFNAPFI YRDGAHALAA
TDPQTSPALQ AINEKLVAQG VRIIGRIYRG DRHISSNFPV KTPADLAGKP FRAVPLELWV
SMVKGFGAIP TPVEVAELPT ALMTGVVVGQ ENPLTMIASN NLNEVQSHLS MTGHMRAVLA
VFINEDVWQG LSEEQRSALT KVLDEEARKS LKMATESEAD LVKELKGRGM TVITEAEGLD
VAAFREKVSA QIRQDFPDFA PLIEQIEAVK