Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_0607 |
Symbol | |
ID | 5163970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 728869 |
End bp | 730593 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640548107 |
Product | para-aminobenzoate synthase, subunit I |
Protein accession | YP_001229392 |
Protein GI | 148262686 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.120756 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGC GGGTGCTTTT TAACTCGTTC TCCTTTGTCG GCCTCCAGGC TGAGATCACC GCATCGTCCC CGGCCGAAGT GCTCCCCGCG CTGCACCGGG TGGAAGAGGC GGTCGCTCGG GGGCTCCATG CGGCCGGCTT TGTTTCCTAC GAGGCTGCGG CCGGGCTCGA TACGGGTCTT GCCACCCGCC CGGCGGCCGG TTTCCCGCTT CTCTGGTTCG GCCTCTTCAG TGAACGCTCT GCGCTTGACC CATCTGCTGA GGTTCCCCCC GCGGAAGCCT ATCAAACATC CGGCTGGACG ACCTCGCTCG ACGGCGCGGA GCACGCCCGT GCCGTCGCCT CGATCAGGGA GCTCATCGCA TGCGGCCACA CCTACCAGGT GAACTTCACC ATGCGCCAAC ACTTCCGGTT CACCGGCGAC CCGGCCGCCT TCTACTCAGA CCTCCGCCGT TGCCAACCCA CACCTTATTC CGCATATATC GACACGGGAC GTTTCCAGAT CCTCTCCGCC TCACCGGAGC TCTTTTTCAG CCTGCGGGAA GGGGTCTTGA CCACCCGCCC CATGAAGGGG ACTGCCGGTC GCGGTCGGTG GTGGCAGGAG GATGAAGAGG CGAAGCAAAA GCTCAGGGAA AGCCCCAAGG AGCGTGCGGA GAACCTGATG ATCGTCGACC TGTTGCGAAA CGACATGGGG ATGGTTTCCA CGACCGGCTC CGTGCGGGTG AACTCACTGT TCGACGTGGA AACGCTCGGG ACCGTGCACC AGATGACCTC AACCATACAA TCGAGGCTCA AGGACGGGGT CGGCCTGACC GGGCTTTTCC AGGCCCTCTT CCCCTGCGGT TCCGTCACCG GCGCGCCGAA AAAGAGGACC ATGGAGATCA TCGCCGAGCT GGAGGATTCC CCCCGCGGCG TCTATACCGG CTGCATCGGT TACGTCTCCC CCGGTGACGA AGCACAGTTC AGCGTGGCGA TCAGGACCAT CGTCATCGAT GCCGCCACCG GCCGGGGGGA GCTGGGGATC GGCAGCGGCG TCACCTTCGA TTCGCAGGCG GACGCCGAAT ACGCGGAATG CCTCGCCAAG GGGCGATTCG CCCGTGAGAA GCCGCGGGAG TTCCACCTCA TCGAATCGCT CCTCTTCGAT GAGGGGAAGG GATATTTCCT CCTGGAGCGC CACATGGAAA GGCTAAAGCG TTCGGCGGCC TACTTCGCCT TCACCCTGGA AGCGGGTGGG GCGGAAAAGG CACTGGAGGA AATCGGCTCC CATCTCATCG GCAAACAAAA GGTGCGGCTG CTCCTTTCCC GGACCGGGGA AATTGCCTGT GAAGCCGCTC CCATCGGGGC GGAGTCAGGA GGGAAAGAAT TGACCGTCAC CTTCGCGAGA CGGACGGCAG ATTCAGCCGA TCCATTTCTC TACCACAAGA CCACCAGCCG GAGCCTCTAC GCCGAAGAAG CGGCGCTGCG TCCCGACTGC GCTGATGTCC TCTTCATAAA CGAGCGGGGC GAGGTCACGG AAGGGACCAT CAGCAACATC GTGGCCCGCA TTGGCGGAGA GCTCGTCACT CCGCCCCTCT GCTGCGGTCT GCTGCCGGGG GTGTTCCGGG AGGAACTGCT GGAGAATGGG GAAATCAGGG AGCGGGTGAT CTGCCGGGAG GAACTGGAAA CAGCCGAAGA AATTTTCCTG GTCAATTCGG TACGAAAGTG GCGGCGGGTA CAGCTTTGCG ATTAA
|
Protein sequence | MKKRVLFNSF SFVGLQAEIT ASSPAEVLPA LHRVEEAVAR GLHAAGFVSY EAAAGLDTGL ATRPAAGFPL LWFGLFSERS ALDPSAEVPP AEAYQTSGWT TSLDGAEHAR AVASIRELIA CGHTYQVNFT MRQHFRFTGD PAAFYSDLRR CQPTPYSAYI DTGRFQILSA SPELFFSLRE GVLTTRPMKG TAGRGRWWQE DEEAKQKLRE SPKERAENLM IVDLLRNDMG MVSTTGSVRV NSLFDVETLG TVHQMTSTIQ SRLKDGVGLT GLFQALFPCG SVTGAPKKRT MEIIAELEDS PRGVYTGCIG YVSPGDEAQF SVAIRTIVID AATGRGELGI GSGVTFDSQA DAEYAECLAK GRFAREKPRE FHLIESLLFD EGKGYFLLER HMERLKRSAA YFAFTLEAGG AEKALEEIGS HLIGKQKVRL LLSRTGEIAC EAAPIGAESG GKELTVTFAR RTADSADPFL YHKTTSRSLY AEEAALRPDC ADVLFINERG EVTEGTISNI VARIGGELVT PPLCCGLLPG VFREELLENG EIRERVICRE ELETAEEIFL VNSVRKWRRV QLCD
|
| |