Gene Acry_1234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1234 
Symbol 
ID5162294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1373400 
End bp1374914 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content72% 
IMG OID640553148 
Productanthranilate synthase component I 
Protein accessionYP_001234364 
Protein GI148260237 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0111358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACCCG AGACCGACCA GCACGCCCGT TTCCGCGCCG CCTACGATGC CGGGCGGGGC 
GCGCTGGCCT GGCGACTGCT CGCCGCCGAC CTGATCACGC CGGTCGCCGC CTTCCTCAAG
CTCGCGCACG GCACGGCGCA TTCCTTCCTG CTCGAGAGCG TCGAGGGCGG GGCGGCGCGC
GGGCGCTATT CGGTGATCGG CTTCGCGCCG GACCTGATCT GGCGGTGCGA CCGCGGCCAG
GTGAGCGTGA ACCGCGACCC GGCGGCGGAC GCGGAAGGCT ACGCGGCCGA TCCGCGCCCG
GCCTTCGAGA GCCTGCGCGA CCTGATCGCG GCGACGCGGC TCGACCTGCC GGACCACCTG
CCGCCGATGA CCGGCGGGCT GGTCGGCTAT CTCGGCTATG ACATGGTGCG GCTGATGGAG
GAGCTGCCGG CGGTGAAGCC GGACGTGCTC GGGATTCCCG ACGCCGTGCT GATGCGGCCG
GGGGTGTTCG CGATTTTCGA TTCGGTGCGC GACGAGATCA CGCTGGCCGC ACCGGTCTGG
CCGCGCGCGG GGGTTTCGGC GGCGGAGGCC TGGAGCCTCG CGACCGGGCG GATCGAGGCG
GCGATGGCGG CGCTCGACCG GCCGACGCCG GCGCTGCCGG GCCATGCCGC GCCGGCCGGG
CCGGCGACCT CGAACATGAG CGAGGACGAG TTCCGCGCGA TGGTGCTGCG GGGCAAGGAC
TACATCGCTG CCGGCGACGT GTTCCAGGTG GTGGCGAGCC AGCGCTTCGA GCAGGAATTC
ACGCTCCCGC CCTTCGCGCT GTATCGCTCG CTGCGGCGGA TCAACCCGGC GCCGTTCCTG
TTCTTCCTCG ATTTCGGCGC CTATCAGGCG GTCGGCTCCA GCCCGGAGAT CCTGGTGCGG
CTGCGCGACG GCACGGTGAC GATCCGCCCG CTGGCCGGCA CGCGCCGGCG CGGGGCGACG
CGCGCCGAGG ACGAGGCGCT GGAGACCGAG CTGCTGGAGG ACCCGAAGGA GCGCGCCGAG
CACCTGATGC TGCTCGACCT CGCGCGCAAC GATGTCGGCC GCGTCGCCGA GATCGGCTCG
GTCAACGTGA CCGAGAGCTT CGTGATCGAG CGGTTCTCGC ACGTGATGCA CATCTCCTCG
AATGTCGAGG GCCGCATCCG GCCGGAGGCG GACGCGCTGG ACGCGCTGAT CGCCGGCTTC
CCGGCGGGCA CGCTCTCGGG CGCGCCGAAG GTGCGGGCGA TGGAGATCAT CGAGGAGTTC
GAGCCCAGCC GGCGCGGGCT CTATGGCGGG TGCATCGGCT ATTTCGCCGC CAACGGCACG
ATGGATACCT GCATCGCGCT GCGCACCGCG CTGATCAGGG ACGGGCGGCT CTATGTCCAG
GCCGGGGTCG GCATCGTCGC CGATTCCGAC CCCGATGCCG AGCTCGCCGA GAGCCACGCC
AAGGCGCGCG CGCTGTTCCG CGCCGCCGAG GATGCCGGCC AGTACGTCAC CCCCAACGAG
GCGCCGGCCC GATGA
 
Protein sequence
MTPETDQHAR FRAAYDAGRG ALAWRLLAAD LITPVAAFLK LAHGTAHSFL LESVEGGAAR 
GRYSVIGFAP DLIWRCDRGQ VSVNRDPAAD AEGYAADPRP AFESLRDLIA ATRLDLPDHL
PPMTGGLVGY LGYDMVRLME ELPAVKPDVL GIPDAVLMRP GVFAIFDSVR DEITLAAPVW
PRAGVSAAEA WSLATGRIEA AMAALDRPTP ALPGHAAPAG PATSNMSEDE FRAMVLRGKD
YIAAGDVFQV VASQRFEQEF TLPPFALYRS LRRINPAPFL FFLDFGAYQA VGSSPEILVR
LRDGTVTIRP LAGTRRRGAT RAEDEALETE LLEDPKERAE HLMLLDLARN DVGRVAEIGS
VNVTESFVIE RFSHVMHISS NVEGRIRPEA DALDALIAGF PAGTLSGAPK VRAMEIIEEF
EPSRRGLYGG CIGYFAANGT MDTCIALRTA LIRDGRLYVQ AGVGIVADSD PDAELAESHA
KARALFRAAE DAGQYVTPNE APAR