Gene M446_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2000 
Symbol 
ID6129203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2236267 
End bp2237523 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content72% 
IMG OID641642231 
Producthypothetical protein 
Protein accessionYP_001768899 
Protein GI170740244 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0807] GTP cyclohydrolase II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.307496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.160134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGT CCAACCGCTC GACGCATATC CGGCTCACCT CGCATCCGGA GCCGGGCGCC 
GCCCGCTGGC CGATCCGCTG GGGCGCCGCC GATCCGCGCG AGCGCGGCCC CGTGATCGGC
ACCGTCACCA ACCCGGCCGA CCGCAACGTG ATCGGGGCGA ATGGCGGGGC CTATTCCCTC
TACCGGGCGC TCGCCATCGC CGGGCGCGCC CTCAACCCGC TGGCCCGGCC CGACCTGACG
AACACCCATC CGGTCGTGCC GATCGGCCCG CACCCGCAAT GGAGCGAGCC GGACCGCATC
GTCTCCCTCG ACCCCTGGGG CCACCTGCCG GGCGAGGTCT TCGCCCGCGA GATCGCCACC
GGCGCCGACA TCCGCCCGAC CATCGCGATC ACCAAGGCGC GGCTCTCGCT GCCGGAGATC
CTGGCGGCGA TCGGCGCCCA CCGCCTCGCG CCGGACGGCG CGATCCTGCA TCCGGGCGGC
GACATCTCGG TGGTGAAGAT CGCGGTCGAT CCGGTCTGGC ACCTGCCGGG CGTCGCGGCG
CGGTTCGGGA CCAGCGAGAC GGCCCTGCGC CGCACGCTCT TCGAGCAGAC CGGCGGCATG
TTCCCGGAAC TGGTGACGCG GCCCGACATG CAGGTCTTCC TGCCGCCGAT CGGCGGCTGC
ACGATCTACA TCATGGGCGA CGCGGCGCGC ATCGCCGAGC CGCGCACCCG CATCGCCTGC
CGGGTCCACG ACGAGTGCAA CGGCTCGGAC GTGTTCGGCT CGGACATCTG CACCTGCCGG
CCCTATCTCA CCCACGGCAT CGAGGAATGC GTGCGGGAGG CGCAGGCGGG CGGCGTCGGC
GTCATCGTGT ACAATCGCAA GGAGGGGCGG GCCCTCGGCG AGGTGACGAA GTTCCTGGTC
TACAACGCCC GCAAGCGCCA GGAGGGCGGC GATTCCGCGG CCACCTACTT CGAGCGCACG
GAATGCGTGG CCGGCGTGCA GGATGCGCGC TTCCAGCAGC TGATGCCCGA CGTGCTGCAC
TGGCTCGGCA TCCGCCGCAT CGACCGGCTG ATGTCGATGT CGAACATGAA GTACGACGCG
ATCACCGGTT CGGGGATCGC GGTCGGCGAG CGGGTGCCGA TCCCGCCGGA ACTGATCCCC
CCGGACGCGG CGGTGGAGAT CGAGGCCAAG AAGGCGGCCG GCTACTACAC GCCCGGCGAG
GCGCCCGATG CGGGGGCGCT CGACGCCGTG AAGGGGCGCG ACCTGGAGCG GTTCTGA
 
Protein sequence
MTTSNRSTHI RLTSHPEPGA ARWPIRWGAA DPRERGPVIG TVTNPADRNV IGANGGAYSL 
YRALAIAGRA LNPLARPDLT NTHPVVPIGP HPQWSEPDRI VSLDPWGHLP GEVFAREIAT
GADIRPTIAI TKARLSLPEI LAAIGAHRLA PDGAILHPGG DISVVKIAVD PVWHLPGVAA
RFGTSETALR RTLFEQTGGM FPELVTRPDM QVFLPPIGGC TIYIMGDAAR IAEPRTRIAC
RVHDECNGSD VFGSDICTCR PYLTHGIEEC VREAQAGGVG VIVYNRKEGR ALGEVTKFLV
YNARKRQEGG DSAATYFERT ECVAGVQDAR FQQLMPDVLH WLGIRRIDRL MSMSNMKYDA
ITGSGIAVGE RVPIPPELIP PDAAVEIEAK KAAGYYTPGE APDAGALDAV KGRDLERF