Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bpro_4457 |
Symbol | |
ID | 4012849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas sp. JS666 |
Kingdom | Bacteria |
Replicon accession | NC_007948 |
Strand | - |
Start bp | 4709826 |
End bp | 4711406 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637944110 |
Product | anthranilate synthase component I |
Protein accession | YP_551242 |
Protein GI | 91790290 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.989225 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATTACTG AACTCGAATT TAAAAGCCTG GCCAGCCAGG GCTACAACCG CATTCCGCTG ATGCTGGAGG CGTTTGCCGA TCTCGAAACC CCGCTGTCGC TCTACCTCAA GCTTGCCAAC GCCAAAGACG GCGGCAAGTT CAGCTTTTTG CTGGAGTCGG TGGTGGGTGG CGAGCGTTTT GGCCGCTACA GCTTTATTGG CTTGCCCGCG CGTACGCTGA TACGCTGCTC GGGCTTCGGG GCCGACATGC TCACCGAGGT GGTGAAGGAC GGCCGGGTCA TCGAAACCTC GCGAGTCAAT CCGCTGGATT TCATCAGTGA CTACCAAAAA CGGTTCAAGG TCGCGCTCAG GCCCGGCCTG CCGCGCTTTT GCGGCGGACT GGCCGGTTAC TTTGGCTATG ACACGGTGCG CTACATCGAG AAAAAACTCG AAGCCTCCTG CCCGCCGGAC ACGCTGGGCT GCCCCGACAT CATGCTGCTG CAGTGCGAAG AACTGGCCGT GATCGACAAC CTCTCGGGCA AGCTCTACCT GATCGTGTAC GCGGATCCGG CCCAGCCCGA GGCCTTTGCC AACGCCAAGA AGCGCCTGCG TGAGCTCAAG GAGCAGCTCA AATACTCCGT CAGCGCGCCG GTCGTCAAGC CCTCGCAGGG CTATCCGGCC GAGCGTGACT TTGCCAAGGC CGACTACATT GCCGCGGTAG AGCGCGCCAA GCGGCTGATC GAGGCCGGCG ATTTCATGCA GGTGCAGGTG GGCCAGCGCA TCAAGAAGCG CTACACCGAG TCGCCGCTGA GCCTGTACCG GGCGCTGCGC GCGCTCAACC CCTCGCCCTA CATGTACTAC TACCACTTTG GCGACTTCCA TGTGGTGGGG GCGTCGCCTG AGATTCTGGT GCGGCAGGAG CAGGTGGCTC GCGTCGCCGG GCCGCCCCAA GACGCGAACG CCCCCTCGGG GGGCAGCGAG TACACGCCAG TGACGAGCGT GGGGGCCCAG TTCGAGCAGA AGATCACGAT CCGGCCCCTG GCCGGCACCC GCCCGCGCGC TTCGTCCATC GAGGCTGACA AGGCGGTCGA GCAGGAGCTG GTCAATGACC CGAAGGAGCG CGCCGAGCAC GTGATGCTGA TCGACCTGGC GCGCAACGAC ATCGGCCGCA TCGCCAAAAC CGGCACCGTC AAGGTGACCG AAGCCTTTGC GGTGGAGCGC TACAGCCATG TGATGCACAT CGTGAGCAAC GTCGAAGGCG TCTTGCTCGA TGGCATGACC AGCATGGATG TGCTCAAGGC GACCTTCCCG GCCGGCACGC TGACCGGCGC GCCCAAGGTG CATGCGATGG AACTGATCGA CCAGCTGGAG CCCACCAAGC GCGGCCTGTA TGGCGGCGCC TGCGGTTACC TCAGTTATGC GGGCGACATG GACGTGGCCA TTGCGATCCG CACCGGCATC ATCAAGGACC AGACCCTGTA TGTGCAGGCG GCGGCCGGCG TGGTGGCTGA CTCGGTGCCC GAGCTGGAAT GGAAAGAAAC CGAAGCCAAG GCGCGCGCCT TGCTGCGCGC CAGCGAACTG GTCGAGGAGG GCCTGGAATG A
|
Protein sequence | MITELEFKSL ASQGYNRIPL MLEAFADLET PLSLYLKLAN AKDGGKFSFL LESVVGGERF GRYSFIGLPA RTLIRCSGFG ADMLTEVVKD GRVIETSRVN PLDFISDYQK RFKVALRPGL PRFCGGLAGY FGYDTVRYIE KKLEASCPPD TLGCPDIMLL QCEELAVIDN LSGKLYLIVY ADPAQPEAFA NAKKRLRELK EQLKYSVSAP VVKPSQGYPA ERDFAKADYI AAVERAKRLI EAGDFMQVQV GQRIKKRYTE SPLSLYRALR ALNPSPYMYY YHFGDFHVVG ASPEILVRQE QVARVAGPPQ DANAPSGGSE YTPVTSVGAQ FEQKITIRPL AGTRPRASSI EADKAVEQEL VNDPKERAEH VMLIDLARND IGRIAKTGTV KVTEAFAVER YSHVMHIVSN VEGVLLDGMT SMDVLKATFP AGTLTGAPKV HAMELIDQLE PTKRGLYGGA CGYLSYAGDM DVAIAIRTGI IKDQTLYVQA AAGVVADSVP ELEWKETEAK ARALLRASEL VEEGLE
|
| |