Gene GM21_0332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0332 
Symbol 
ID8135639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp412273 
End bp413994 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content67% 
IMG OID644867949 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_003020171 
Protein GI253698982 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value1.42564e-18 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGACGGCCG TTTCCCCCGC ATACCTGCAT GGTTTCAGCT TCCGCGCGCC GGCGGGCGAG 
ATCTGCGCCG TCACGCCCGG GCAGGTCGTC CCCGCGCTAA GATCCCTGGA ACGGCAGGTG
GCCTCGGGGC TGCACGCGGC GGGGTTTGTG AGCTACGAGG CGGCCGGCGC CTTGAACGGC
GACCTCACCA CTTGCGCGCC CGGAAAGCTG CCGCTCCTTT GGTTCGGCCT CTACCAAAGC
CGCTCCCGGG CGCCCTTTCC GCCCCACGCC AGCCATTTTT ACTGCGGCGA CTGGCGCCCC
TCGCTCGACG CGGCGGCGTT CGACCGGGGG GTGGCGGCGA TCCGCGAGCT GATCGCCGCC
GGCGACTGCT ACCAGGTAAA TTTCACCCTG CGCCAGCGCT TCTCGTTTGC AGGGTGCCCC
CGCTCCTTCT TCTCGGAGTT GAGCCGCAGC CAGCCGACCC CTTACGGCTG CTACCTGGAG
ACGGGGGACT TCCGCATCCT CTCCGCGTCC CCCGAGCTTT TCTTCTCCCT GTCCGAAGAC
GTCCTCACCA CCCGTCCCAT GAAGGGGACG GCGCCCCGGG GGAGGTGGCC GGACGAGGAC
CGGGCCCGGA GAAGGAGCCT GAAGGAAAGC CCCAAGGAGC TGGCGGAAAA CCTGATGATC
GTGGATCTCC TGCGAAACGA CATGGGGCTG GTGTCGCGAA CCGGTTCGGT GCGGGTCGCC
TCCCTCTTCG ACGTGGAGAG TTACCCAACC GTGCACCAGA TGACCTCCAC CATCGAGTCG
AGGCTCCGGG AGGGGGTGGG AACGCTGGAG CTCTTCCAGG CGCTTTTCCC CTGCGGCTCC
GTGACCGGCG CCCCCAAGAA AAGGAGCATG GAGATCATCG CGCAACTGGA GGGGGAGCCG
CGCGGCCTTT ACACCGGGTG CATCGGCTAC CTCTCCCCGG GGGGCGAGGC GAAATTCAGC
GTCGCCATAC GCACCGCCGT CCTCGACCTC AAGGCCGGCG AGGGGGAGAT CGGCATCGGC
AGCGGCATCA CCTACGATTC CGTCGCCGGG GACGAGTACC GCGAGTGTAT CTCCAAGGCG
CGCTTCGCCC GCGAACCTCT CCCCGAGTTC CAGCTGATCG AATCGCTTCT TTACGACGGC
GGCTATTTTC TGCTGGAGCG GCACCTGGAA AGGCTCGCCC GCTCCGCCGC CTACTTCTCC
TTTGCGCTGC AGCCGGAGGC GGCCAGCCGT GCCTTGGAAG AAACCGCTGC GGGGCTATCC
GCCGGGAAAA GCTACAAGGT GCGCCTGCTC TTAAGCCGGG ACGGGAGCCT CGCCTGCGAG
GCGGCGCCGA TAGAGCCCGT CGCGATTCAA ACCACCGCTG GCTTCGCGGC CGCCCGGGTC
GACTCGCAGG ACCGGTTCCT CTACCACAAG ACCACGCTGC GCGACCGCTG CCGGTACGAG
TTGGCGGCGC GGCCGGAACT GGACGAGGTG ATCTTCGAGA ACGAGCACGG CGAGGTCACC
GAAGGGGCCA ACAGCAACAT CGTGGCGCGC ATCGAGGGGC GCTATCTGAC CCCTCCGCTG
GCAAGCGGGC TCTTGCCCGG GACCTTTCGG GAGGAACTTC TGGCAGAGGG AACCATCGAG
GAGCGTGTGC TGACCCGTGC GGACCTGGAA GGTGCCGAGG CGCTCTTTCT GATAAACTCG
GTGCGCAAGT GGCGCCCGGC GGCACTGATG TCGATGAGAT AG
 
Protein sequence
MTAVSPAYLH GFSFRAPAGE ICAVTPGQVV PALRSLERQV ASGLHAAGFV SYEAAGALNG 
DLTTCAPGKL PLLWFGLYQS RSRAPFPPHA SHFYCGDWRP SLDAAAFDRG VAAIRELIAA
GDCYQVNFTL RQRFSFAGCP RSFFSELSRS QPTPYGCYLE TGDFRILSAS PELFFSLSED
VLTTRPMKGT APRGRWPDED RARRRSLKES PKELAENLMI VDLLRNDMGL VSRTGSVRVA
SLFDVESYPT VHQMTSTIES RLREGVGTLE LFQALFPCGS VTGAPKKRSM EIIAQLEGEP
RGLYTGCIGY LSPGGEAKFS VAIRTAVLDL KAGEGEIGIG SGITYDSVAG DEYRECISKA
RFAREPLPEF QLIESLLYDG GYFLLERHLE RLARSAAYFS FALQPEAASR ALEETAAGLS
AGKSYKVRLL LSRDGSLACE AAPIEPVAIQ TTAGFAAARV DSQDRFLYHK TTLRDRCRYE
LAARPELDEV IFENEHGEVT EGANSNIVAR IEGRYLTPPL ASGLLPGTFR EELLAEGTIE
ERVLTRADLE GAEALFLINS VRKWRPAALM SMR