Gene Rmet_4720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_4720 
Symbol 
ID4041581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp1362535 
End bp1363635 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content63% 
IMG OID637980141 
Producthypothetical protein 
Protein accessionYP_586851 
Protein GI94313642 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACAA CGATCACGAC GGTCGGGCTC AAGCGCCTCG CCGTTGCCAC GCTATTCGCC 
TGCGGGATAG TCCATGCGCA ATCGACGGTG CCGGTGATGC AACCCACCGA AGGCGATTTC
GAAGCACGCG ACTTCCACTT CCAGAGCGGC CAGACGCTGC CGACCGTCAA GCTTCACTAC
GCGACACTGG GCACGCCCAC TCGCGGCGCC GACGGCAAGG TCAACAACGC CGTGCTTCTG
CTCCACGGCA CTACCGGCAC CGGACGCGCG TACCTGACGC CGCTGATGCA GAAGGAACTG
TTCGCGGCGG GACAGCCGCT CGACGCCTCG CGTTACTACA TCATCATGCC CGACGGCATC
GGCCGTGGCG GATCGAGCAA GCCCAGCGAC GCCCTGCGCG CGAACTTTCC GCGCTATGGC
TACAACGATG TGGTGGAAGG CCACTACCGC CTGCTGACCG AGGGACTCAA AGTCGATCAC
CTGCGGTTGA TACTGGGCAC GTCGATGGGC GGCATGCAGA CGTGGGTCTG GGGTGAACGG
CATCCGGACA TGATGGATGC GCTGATGCCG ATCGCCAGCC AGCCCGTGGC GATGTCGGGC
CGCAACTGGT TGTGGCGCCG GATGCTGATC GACGCGATCC GGAATGACCC GGACTGGAAC
GGCGGCAACT ACACGCGGCA GCCCACGCAC TGGACCCGCA CCACGCCGGT ATTCGCCCTG
ATGACGCAAA GCGCGGCCAC GTTGCAGAAG GCCGCTCCTA CGCGCGACCA GGTCAACCAG
TACGTCGACA AGACCGTGGC GGACAGCCGC GGCGTGGACG CCAATGACTA CCTGTACTGG
TTCGAATCAT CATGGGACTA CAACCCCGAG CCGGATCTGG GCATGATCCG CGCGCCGCTT
TACGCGGTGA ACTTCGCCGA CGACATGATC AACGCGGTGG ACCTCGGCGT CATGCAACGC
ACCGTGCCGA AGGTACGGCA AGGCAAGTAC GTGGAGATGC CGGAGAGCGT GAACACATAT
GGCCATCAGA CGTTGCAACA CCCCGAGGTC TGGAAGCCGT ATCTCGTTGA ACTGCTGAAG
TCGCTACCCG CGCAAAAGTA G
 
Protein sequence
MKTTITTVGL KRLAVATLFA CGIVHAQSTV PVMQPTEGDF EARDFHFQSG QTLPTVKLHY 
ATLGTPTRGA DGKVNNAVLL LHGTTGTGRA YLTPLMQKEL FAAGQPLDAS RYYIIMPDGI
GRGGSSKPSD ALRANFPRYG YNDVVEGHYR LLTEGLKVDH LRLILGTSMG GMQTWVWGER
HPDMMDALMP IASQPVAMSG RNWLWRRMLI DAIRNDPDWN GGNYTRQPTH WTRTTPVFAL
MTQSAATLQK AAPTRDQVNQ YVDKTVADSR GVDANDYLYW FESSWDYNPE PDLGMIRAPL
YAVNFADDMI NAVDLGVMQR TVPKVRQGKY VEMPESVNTY GHQTLQHPEV WKPYLVELLK
SLPAQK