Gene M446_5228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5228 
Symbol 
ID6131142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5749089 
End bp5750129 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content67% 
IMG OID641645362 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_001771986 
Protein GI170743331 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.379056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0433147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGCA CCGACGGCCG CCTGCGGCCC CTCTCCCGTC GCGCGCTCTG CGCCGCGGCC 
GCGCTCGCGC TGCTCGGGGG CCAAACCCTG CTCGGGGGGC AAGCCCTCGC GCAGGGCCCG
ATCGTCCTCA AGTTCAGCCA CGTCACCGCT CCCGACACGC CCAAGGGTCG CGGCGCCGAC
AAGTTCAAGG AACTGGCCGA GAAATATACC GGCGGAAAGG TCAAGGTCGA AGTCTACCCG
AACTCGCAGC TGTTCAAGGA CAAGGAGGAG GTCGAGGCGC TGCAGCTCGG CGCCGTGCAG
ATGCTGGCGC CCTCGCTGGC GAAGTTCGGG CCGCTCGGCG CGAAGGAGTT CGAGGTCTTC
GACCTGCCCT ACATCCTGCC TGACAAGGCG GCCCTGCGGC GCGTGACCGA GGGGCCGCTC
GGCCGGCGCC TGTTCGAGAA GCTCGAGACC AAGGGCATCA CCGGCCTCGC CTACTGGGAC
AACGGCTTCA AGATCATGAG CGCCAACAAG CCGCTGCGGC TCCCCGCGGA TTTCCGCGGC
CTCAAGATGC GCATCCAGTC CTCCAAGGTG CTGGAGGCGC AGTTCCGCGC GCTCGGGGCG
ATTCCCCAGG TGATGGCCTT CTCGGAAGTC TATCAGGGCC TGCAGACCGG CGTGGTGGAC
GGGTCCGAGA ACACGCCCTC GAACATGTAC ACGCAGAAGC ACCACGAGGT GCAGAAATAC
GCGACCCTCT CCGACCACGG CTATATCGGC TACGCGGTGA TCACGAACAA GAAATTCTGG
GACGGCCTGC CGCCGGAGGT GCGCGGCCAG CTCGAGAAGG CGATGGCGGA GGCGACGGCC
TACGCCAACG AGGTCGCGGG CCGGGACAAT GCCGATGCCC TGGAGGAGAT GCGGAAGTCC
GGCAAGATCA CCTTCCTGAC CCTGACCGAC GAGGAGAAGG CGGCCTGGCG CAAGGCCCTC
GAGCCGGTCA CGGCCGAGAT GACCAAGCGC GTCGGCAAGG ACGTGATCGA GGAGTTCCAG
CGCGAGGCGC GCACGCAGTA G
 
Protein sequence
MSRTDGRLRP LSRRALCAAA ALALLGGQTL LGGQALAQGP IVLKFSHVTA PDTPKGRGAD 
KFKELAEKYT GGKVKVEVYP NSQLFKDKEE VEALQLGAVQ MLAPSLAKFG PLGAKEFEVF
DLPYILPDKA ALRRVTEGPL GRRLFEKLET KGITGLAYWD NGFKIMSANK PLRLPADFRG
LKMRIQSSKV LEAQFRALGA IPQVMAFSEV YQGLQTGVVD GSENTPSNMY TQKHHEVQKY
ATLSDHGYIG YAVITNKKFW DGLPPEVRGQ LEKAMAEATA YANEVAGRDN ADALEEMRKS
GKITFLTLTD EEKAAWRKAL EPVTAEMTKR VGKDVIEEFQ REARTQ