Gene Bpro_4457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_4457 
Symbol 
ID4012849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp4709826 
End bp4711406 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content64% 
IMG OID637944110 
Productanthranilate synthase component I 
Protein accessionYP_551242 
Protein GI91790290 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.989225 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTACTG AACTCGAATT TAAAAGCCTG GCCAGCCAGG GCTACAACCG CATTCCGCTG 
ATGCTGGAGG CGTTTGCCGA TCTCGAAACC CCGCTGTCGC TCTACCTCAA GCTTGCCAAC
GCCAAAGACG GCGGCAAGTT CAGCTTTTTG CTGGAGTCGG TGGTGGGTGG CGAGCGTTTT
GGCCGCTACA GCTTTATTGG CTTGCCCGCG CGTACGCTGA TACGCTGCTC GGGCTTCGGG
GCCGACATGC TCACCGAGGT GGTGAAGGAC GGCCGGGTCA TCGAAACCTC GCGAGTCAAT
CCGCTGGATT TCATCAGTGA CTACCAAAAA CGGTTCAAGG TCGCGCTCAG GCCCGGCCTG
CCGCGCTTTT GCGGCGGACT GGCCGGTTAC TTTGGCTATG ACACGGTGCG CTACATCGAG
AAAAAACTCG AAGCCTCCTG CCCGCCGGAC ACGCTGGGCT GCCCCGACAT CATGCTGCTG
CAGTGCGAAG AACTGGCCGT GATCGACAAC CTCTCGGGCA AGCTCTACCT GATCGTGTAC
GCGGATCCGG CCCAGCCCGA GGCCTTTGCC AACGCCAAGA AGCGCCTGCG TGAGCTCAAG
GAGCAGCTCA AATACTCCGT CAGCGCGCCG GTCGTCAAGC CCTCGCAGGG CTATCCGGCC
GAGCGTGACT TTGCCAAGGC CGACTACATT GCCGCGGTAG AGCGCGCCAA GCGGCTGATC
GAGGCCGGCG ATTTCATGCA GGTGCAGGTG GGCCAGCGCA TCAAGAAGCG CTACACCGAG
TCGCCGCTGA GCCTGTACCG GGCGCTGCGC GCGCTCAACC CCTCGCCCTA CATGTACTAC
TACCACTTTG GCGACTTCCA TGTGGTGGGG GCGTCGCCTG AGATTCTGGT GCGGCAGGAG
CAGGTGGCTC GCGTCGCCGG GCCGCCCCAA GACGCGAACG CCCCCTCGGG GGGCAGCGAG
TACACGCCAG TGACGAGCGT GGGGGCCCAG TTCGAGCAGA AGATCACGAT CCGGCCCCTG
GCCGGCACCC GCCCGCGCGC TTCGTCCATC GAGGCTGACA AGGCGGTCGA GCAGGAGCTG
GTCAATGACC CGAAGGAGCG CGCCGAGCAC GTGATGCTGA TCGACCTGGC GCGCAACGAC
ATCGGCCGCA TCGCCAAAAC CGGCACCGTC AAGGTGACCG AAGCCTTTGC GGTGGAGCGC
TACAGCCATG TGATGCACAT CGTGAGCAAC GTCGAAGGCG TCTTGCTCGA TGGCATGACC
AGCATGGATG TGCTCAAGGC GACCTTCCCG GCCGGCACGC TGACCGGCGC GCCCAAGGTG
CATGCGATGG AACTGATCGA CCAGCTGGAG CCCACCAAGC GCGGCCTGTA TGGCGGCGCC
TGCGGTTACC TCAGTTATGC GGGCGACATG GACGTGGCCA TTGCGATCCG CACCGGCATC
ATCAAGGACC AGACCCTGTA TGTGCAGGCG GCGGCCGGCG TGGTGGCTGA CTCGGTGCCC
GAGCTGGAAT GGAAAGAAAC CGAAGCCAAG GCGCGCGCCT TGCTGCGCGC CAGCGAACTG
GTCGAGGAGG GCCTGGAATG A
 
Protein sequence
MITELEFKSL ASQGYNRIPL MLEAFADLET PLSLYLKLAN AKDGGKFSFL LESVVGGERF 
GRYSFIGLPA RTLIRCSGFG ADMLTEVVKD GRVIETSRVN PLDFISDYQK RFKVALRPGL
PRFCGGLAGY FGYDTVRYIE KKLEASCPPD TLGCPDIMLL QCEELAVIDN LSGKLYLIVY
ADPAQPEAFA NAKKRLRELK EQLKYSVSAP VVKPSQGYPA ERDFAKADYI AAVERAKRLI
EAGDFMQVQV GQRIKKRYTE SPLSLYRALR ALNPSPYMYY YHFGDFHVVG ASPEILVRQE
QVARVAGPPQ DANAPSGGSE YTPVTSVGAQ FEQKITIRPL AGTRPRASSI EADKAVEQEL
VNDPKERAEH VMLIDLARND IGRIAKTGTV KVTEAFAVER YSHVMHIVSN VEGVLLDGMT
SMDVLKATFP AGTLTGAPKV HAMELIDQLE PTKRGLYGGA CGYLSYAGDM DVAIAIRTGI
IKDQTLYVQA AAGVVADSVP ELEWKETEAK ARALLRASEL VEEGLE