Gene Amir_5704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_5704 
Symbol 
ID8329911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp6734079 
End bp6735620 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content73% 
IMG OID644946142 
Productanthranilate synthase component I 
Protein accessionYP_003103365 
Protein GI256379705 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAGCG TCATCGGTGC GGGCGTCGGG CTCGGCGAGG TCAGTCCCAC CCGCGAGGAG 
TTCCGCGAAC TCGCCGCGCA GCGCAGGGTC GTCCCTGTCG TGCGCCGGAT CCTCGCGGAC
GCGGAGACCC CCGTCGGGCT GTACCGGAAG CTGGCCGCCG ACCGGCCGGG CACGTTCCTG
TTCGAGTCGG CGGAGAACGG CCGCTCGTGG GCGCGCTGGT CGTTCGTGGG GGCCCGGTGC
GCGGGCGCGC TCACCGCGAC CGGCGGCGAG GCGCACTGGA CCGGGCAGCA CCCGGTCGGG
TTGCCCGAGG GCGGCGACCC GCTGGCGGCG CTGCGCGAGA CCATCGAGGT GCTGCGCACC
GACCCGCTGC CCGGCCTGCC CCCGCTGACC GGCGGCATGG TCGGCTACAT CGGGTACGAC
GCGGTGCGCC GCCTGGAGCG GCTGCCCTCG CTGGCCGAGG ACGACTTGAA GGTGCCCGAG
CTGGTCATGC TGCTCGCCAC CGACCTCGCC GCGCTCGACC ACCACGAGGG CACCGTCACG
CTGATCGCCA ACGCGATCAA CTGGGACGAC ACCCCGGAGC GGGTCGACGC GGCCTACGAC
GACGCGGTGC GCAGGCTCGA CGTGATGACC GAGGACCTGC ACAACCCGGC GCCCGCCACG
ACCGCCGTGT TCTCCCGGCC CAAGCCGGAG TTCACCCGGC GGCGCTCCTC CGCCGAGCAC
CAGGCCGTGG TGGAGAAGGC GAAGGCGGCG ATCCGGGAGG GCGAGGCCTT CCAGGTGGTG
CTGTCGCAGC GGTTCGAGAT GGAGACCACC GCGCACCCGC TGGACGTCTA CCGGGTGCTG
CGCACCTCCA ACCCCAGCCC GTACATGTAC CTGCTGCGGC TGGACGACTT CGACATCGTC
GGGTGCAGCC CCGAGTCGCT GGTCACGGTC CGGGACGGCA AGGCCACCAC TCACCCGATC
GCGGGCACCC GGTGGCGCGG CGCGGACCCG GAGGAGGACG CGCTGCTGGA GAAGGACCTG
CTCTCCGACG ACAAGGAGCG CGCCGAGCAC CTCATGCTCG TGGACCTGGG CCGCAACGAC
CTGGGCCGGG TGTGCAAGCC GGGCTCGGTG ACCGTGGTGG ACTTCTTCAA GGTCGAGCGG
TACAGCCACG TGATGCACAT CGTGTCCACG GTCAGCGGCG AGCTGGCCGA GGGGCGCACC
GCGTTCGACG CGGTCGCGGC GTGCTTCCCG GCGGGCACCC TGTCCGGCGC GCCGAAGCCG
CGCGCGATGG AGCTGATCGA GGAGCTGGAG CCGACCCGGC GCGGCCTGTA CGGCGGCGTC
GTGGGCTACC TGGACTTCGC GGGCGACGCG GACACCGCGA TCGCGATCCG CACCGCGCTG
ATCCGCGACG GGGTCGCGTA CGTGCAGGCG GGCGGCGGCA TCGTGGCCGA CTCGGACCCG
GTGGCCGAGG ACAACGAGTG CCTGAACAAG GCGGGCGCGG TCCTGGCCGC GATCGCGACG
GCCCAGACCA TGGCCCCGGC CGTGGAACCG ACCCGTGTCT GA
 
Protein sequence
MVSVIGAGVG LGEVSPTREE FRELAAQRRV VPVVRRILAD AETPVGLYRK LAADRPGTFL 
FESAENGRSW ARWSFVGARC AGALTATGGE AHWTGQHPVG LPEGGDPLAA LRETIEVLRT
DPLPGLPPLT GGMVGYIGYD AVRRLERLPS LAEDDLKVPE LVMLLATDLA ALDHHEGTVT
LIANAINWDD TPERVDAAYD DAVRRLDVMT EDLHNPAPAT TAVFSRPKPE FTRRRSSAEH
QAVVEKAKAA IREGEAFQVV LSQRFEMETT AHPLDVYRVL RTSNPSPYMY LLRLDDFDIV
GCSPESLVTV RDGKATTHPI AGTRWRGADP EEDALLEKDL LSDDKERAEH LMLVDLGRND
LGRVCKPGSV TVVDFFKVER YSHVMHIVST VSGELAEGRT AFDAVAACFP AGTLSGAPKP
RAMELIEELE PTRRGLYGGV VGYLDFAGDA DTAIAIRTAL IRDGVAYVQA GGGIVADSDP
VAEDNECLNK AGAVLAAIAT AQTMAPAVEP TRV