Gene M446_6030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_6030 
Symbol 
ID6129808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6617127 
End bp6618143 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content70% 
IMG OID641646129 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_001772741 
Protein GI170744086 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.122979 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.224939 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTCA CGCGTCGTTC GCTCGTGGCC GCCGGTCTGG CGGCGCCCGC GCTCCTGTCC 
GCCCGGACGG CCCGGGCGGC CACCACCCTC AAGCTGTCGC ACCAGTTCCC GGGCGGCTCG
ATCGACGAGG GCGACTTCCG CGACCGCATG GCGCGCAAAT TCGCGGTCGC CCTCAAGGAG
CGCTCGAAGG GCGCCCTCGA CCTGCAGGTC TATCCGGGCT CGTCCCTGAT GAAGACCAAC
GCCCAGTTCA GCGCGATGCG GAAGGGCGCG CTCGACATGA GCCTCTACCC GCTCCCCTAC
GCGGGCGGCG AGGTGCCGGA GACCAATATC GGGCTGATGC CGGCGCTGGT GACCTCCTAC
GAGCAGGCCA AGGCCTGGAA GGCCGCCCCG GTCGGCCGCA AGCTCGCCGA GATCCTGCAG
GCCAAGGGCA TCGTGATGGT GTCCTGGGTC TGGCAATCGG GGGGCGTGGC GAGCCGCGAG
CGCCCGCTGG TGACGCCGGA CGACGCCAAG GGCATGAAGG TGCGCGGCGG CTCCCGCGAG
ATGGACCTGA TGATGAAGGC GGCGGGCGCC GCCACCCTCA GCCTGCCCTC CAACGAATCC
TACGCGGCGA TGCAGACCGG CGCCTGCGAC GCCGTCATCA CCTCCTCGAC CAGCCTGATC
TCCTTCCGCC TGGAGGAGCT CGCCAAGGCG CTGACCTCCG GCAAGGGCCG CTCCTACTGG
TTCATGCTGG AGCCGATCAT GATGTCGAAG ATCGTCTTCG ACGGCCTGCC GAAGGACCAG
AAGGACCTGA TCATGCAGGT CGGGGCCGAG CTGGAGTCCT TCGGGCAGCA GGGCGCGATG
GCGGACGACG ACCGGGTCGA GCAGGTCTTC GGGAAGGCCG GCGCCAAGAT CGCGACCCTC
GACGAGGCGA CCCTCGGCCG CTGGCGCGAC ATCGCCCGCG ACACCGCCTG GAAGGATTAC
GCGGCGAAAT CGTCGAGCTG CGCCGAGATG CTCAAGCTCG CGGAGGCAGT CGCGTGA
 
Protein sequence
MALTRRSLVA AGLAAPALLS ARTARAATTL KLSHQFPGGS IDEGDFRDRM ARKFAVALKE 
RSKGALDLQV YPGSSLMKTN AQFSAMRKGA LDMSLYPLPY AGGEVPETNI GLMPALVTSY
EQAKAWKAAP VGRKLAEILQ AKGIVMVSWV WQSGGVASRE RPLVTPDDAK GMKVRGGSRE
MDLMMKAAGA ATLSLPSNES YAAMQTGACD AVITSSTSLI SFRLEELAKA LTSGKGRSYW
FMLEPIMMSK IVFDGLPKDQ KDLIMQVGAE LESFGQQGAM ADDDRVEQVF GKAGAKIATL
DEATLGRWRD IARDTAWKDY AAKSSSCAEM LKLAEAVA