Gene Rmet_2920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_2920 
Symbol 
ID4039748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007973 
Strand
Start bp3173288 
End bp3175201 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content66% 
IMG OID637978320 
Productpara-aminobenzoate synthase, component I 
Protein accessionYP_585062 
Protein GI94311852 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.388176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAACG CAGCGATAAT GCCGACTGCA GCGTTTTCGC ATCCAGTCTT CGTGGTACCC 
ATCCCTTCGC CCCCCTTCGT TCTGCTCGAC GACGCCACAG CCGGCTCTGG CGTGACGGCT
TCGCGCCTCT ATACCGATTT CGTCCGGGAG GACGTGCTGG CGGCCGGTGC CGACGTATCG
GCGCTTGACA CGCTGCTGGC CAGCGGTTGG CGGGACGGGT TGCACGCCAC GTTCTTTGCG
CCCTACGAAT TCGGTGGTGC GATCGTTGGC GCACCCGTGC ACGTTGGCGA TGCGCTGCCG
TTTCACGACG GCGCCCTGCG CGTGCTGTGG TTTCGCACGT TGCGCCGGCT CGATACGGAA
GGTGTGGCGC AGTGGCTAGC CGCAGGCGCA CAACCAGGCC CGACGGGCGC GCTCGACGTG
ACGGCCAGCA CCACGCGGAC GCAATACACC GAAGCGATCG CGCGTATCCA CGATTACATC
GAGGCGGGCG ATACCTATCA GGTTAACTTC ACGCAACGAC TACGCTGTCG CGTGTTTGGC
GATCCGATGG CGTTCTATGC GGCGTTGCGC GCGGCACAGC CGGTGCCGTT CGGCGTGCTT
GCGCATCTGC CCGGTGGCGG TTGGGTGCTG TCGCTATCGC CCGAACTGTT CGTCGAGCAC
GATGGCCACG GACATCTCGT CGCACGTCCA ATGAAGGGCA CCGCGCCGCG CTCAGGGGAT
GCCGAACAGG ATGCGCGCGC CGCGAAACGA CTGGCCACCG ATGCCAAGAA CCGCGCCGAG
AACGTGATGA TCGTCGACCT GCTGCGCAAC GACCTCGGTC GCGTGGCGAT CCCCGGGAGC
GTGGCGGTGC CCGAGCGCTT CGTCGTCGAA CCGTTCGGTC GCGTGTTGCA AATGACGTCG
ACAGTCACCG CGACCGCGCG CCCGGGTACG TCGTTCGGTG CGTTGATGGC CGCGCTGTTC
CCGTGCGGCT CGATTACCGG CGCGCCCAAG CGGCGCACGA TGCAGATCAT CGCCGAGCTC
GAGACGTCGC CACGTGGGTT GTACACGGGC GCGATCGGTT GGCTCGATGC GACATCGCAT
AAGACTACGG AGGTCATAGG CGTCGGCGCC TTCGGCATGT CTGTGGCAAT CCGCACGCTG
GTGCTGGCGC CGCCCGGGGC CGATGGCCTG CGCGCGGCCG AGATGGGCGT GGGCGGTGGC
ATCGTGCACG ACAGCGTGGC CGACGATGAG TACGCCGAAT GCGGATGGAA AGCGCGGTTC
CTGATCGGTC ATGACCCCGG CTTCACGTTG TTCGAGACCA TGCATGCGCG CGATGGCGCA
GTGCTACATC GCGAGCGCCA TTTGCAGCGA CTAGCGAACT CGTCGGCGGC GTTCGGCTTC
GCGCTGGACT TGCCTGAAGC CCGCGCGGCG GTGCAGGCTG AAGCCGCGCG ACTGGGCGAT
GGCGACTGGC GCCTGCGCGT GAGTGTGGAC AAGCGCGGCA CGCTGGTATT TGCCAGCGGG
GCATTGGCTC CGCTCGCGCA TGGCGTCGTG GGGCTCGAGA TCGCACCGAC ATGCCTGCCC
GTTCACGATC CGCTGCGCCG GCACAAGCTC AGCGCGCGCG CCGTGTTCGA TCAGGGATGG
CAGGCCGCGG AGCGGGCAGG CGGGTTCGAT AGCCTGTTCT TCAACACGTG CGACGAACTG
CTCGAAGGCG GCCGCAGTTC GGTGTTCATC AAGGTAGATG GTCATTGGAT GACACCGCCA
CTCTCGGCGG ACATCCTGCC GGGCGTGATG CGTGCCGTGG TGCTCGAAGA AGGGGACGCA
TTGATCGACG GACCGGTGCA GGAAGCCACC GTGACGCGTG ACATGGTGCA ACGCGCCGAA
GCCATCGTTA TCGCCAACGC ATTGCGCGGC ACGCTGCGCG CGCGCCTGAT CTGA
 
Protein sequence
MPNAAIMPTA AFSHPVFVVP IPSPPFVLLD DATAGSGVTA SRLYTDFVRE DVLAAGADVS 
ALDTLLASGW RDGLHATFFA PYEFGGAIVG APVHVGDALP FHDGALRVLW FRTLRRLDTE
GVAQWLAAGA QPGPTGALDV TASTTRTQYT EAIARIHDYI EAGDTYQVNF TQRLRCRVFG
DPMAFYAALR AAQPVPFGVL AHLPGGGWVL SLSPELFVEH DGHGHLVARP MKGTAPRSGD
AEQDARAAKR LATDAKNRAE NVMIVDLLRN DLGRVAIPGS VAVPERFVVE PFGRVLQMTS
TVTATARPGT SFGALMAALF PCGSITGAPK RRTMQIIAEL ETSPRGLYTG AIGWLDATSH
KTTEVIGVGA FGMSVAIRTL VLAPPGADGL RAAEMGVGGG IVHDSVADDE YAECGWKARF
LIGHDPGFTL FETMHARDGA VLHRERHLQR LANSSAAFGF ALDLPEARAA VQAEAARLGD
GDWRLRVSVD KRGTLVFASG ALAPLAHGVV GLEIAPTCLP VHDPLRRHKL SARAVFDQGW
QAAERAGGFD SLFFNTCDEL LEGGRSSVFI KVDGHWMTPP LSADILPGVM RAVVLEEGDA
LIDGPVQEAT VTRDMVQRAE AIVIANALRG TLRARLI