Gene Franean1_1928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1928 
Symbol 
ID5670329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2310397 
End bp2311974 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content72% 
IMG OID641240849 
Productanthranilate synthase component I 
Protein accessionYP_001506271 
Protein GI158313763 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.929386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG GCGAGATCAC ACCGAGCCGG GCGGAGTTCC ACGAGCTCGC CGCGCGCCAG 
CCGGTCGTCG CGGTGTCCCG CCGCCTGCTC GCCGACGGCG AGACACCGGT CGGGGTCTAC
CGCAAGCTGG CCGGCGGGCC GGGGACGTTT CTGCTCGAGT CTGCCGAGCA CGGGGGCGTG
TGGTCGCGTT ACTCCTTCGT CGGCGTCCGT GCCGCGGCCA CACTCACTGA ACGGGACGGG
CAGGCCGCCT GGACGGACGG AACCCCGCCG CCCGGTGTCC CGCTCGACGG CGACCCGCTC
GATGTCCTGC GTGCCGTGGA ACGCCAGCTC TGCTCGGCCC GGCCGAGTGG CACTCCGCCG
CTGCTGGGCG GCCTGGTCGG GTACCTCTCG TACGACATCG TCCGGCGCAT CGAGCGGTTG
CCCGCCCGGG CAACCGACGA TCTCGGGATG CCCGAGCTGC GGATGCTCCT GACGACCGAC
CTGGCCGTCC TCGACCACAC CGACGGATCG TGCCAGCTCG TCGCGAACAT CTTCACCGGC
GCCGGGGACC CGGCCGACCC GGCCGAAGCA GCGGACCTGG CTGGCCCAGG AGCGGCGGGC
GGACCGTCCG CGCGGCGGGC GGAGCTGGAC GCCGCCTACG ACGATGCGGT GCACCGCATC
GAGGTGATGA CGGCAGATCT CGGCAAGTGG AGTGAGCCGA CCGTGGCGAC CACAACCGGG
GCGTCCACGG GCGTGCGCGA CTTCGCCTCC GCAACCCCGC CCGGCGGCTT CCACGCTGCC
GTCGAGCGGT CGATCGAGGA GATCCGGGCG GGGGAGTGCT TCCAGATCGT GGTCTCCCAG
CGGTTCGAGC GCCCCACCAC CGCTGACGCC CTCGACGTCT ACCGGGTCCT GCGGGCGTCG
AACCCCAGCC CCTACATGTA CCTGCTGCGG TTCGCCGATC ATGATGTGGT CGGCTCGTCG
CCGGAGGCGC ACGTCAAGGT CACCGGCCGC CGGGCGTTGC TGCACCCGAT CGCGGGCAGC
CGGCCGAGGG GCGAGACCCC CGAGCGCGAT GCCGAACTGG CTGCCCAGCT CCTGGCCGAT
CCGAAGGAAC GGTCCGAGCA CGTGATGCTG GTCGACCTGG TCCGCAATGA TCTCGGGCGG
GTCTGCGTGC CCGGATCGGT GCGGGTGGTC GAGTTCGCGT CCGTCGAGCG GTTCTCGCAC
ATCATGCACA TCGTCTCCAC CGTGATCGGT GAGGTGGCGC CCGAGCGCAG CGCGGTCGAC
GTCCTCGCCG CGACCTTTCC CGCCGGGACG TTGTCGGGAG CGCCCAAGGT GCGGGCCATG
GAGATCATCG ACGAGCTCGA GCCGACGAGG CGCGGCCTGT ACGGCGGGGT CGTGGGATAT
CTCGATTTCG GCGGTGACCT CGACACCGCG ATCGCCATCC GCACAGCGGT CCTCCGTTCA
GGAATGGCCT ACGTGCAGGC CGGCGCCGGG ATCGTGGCGG ACTCCGTTCC CGACACCGAG
GATCTCGAGA GCCGGACGAA GGCCGCGGCG GTTCTCCGCG CGATCGAGGT GGCGGAGTCG
CTCCGCCCGC CGGTATGA
 
Protein sequence
MTTGEITPSR AEFHELAARQ PVVAVSRRLL ADGETPVGVY RKLAGGPGTF LLESAEHGGV 
WSRYSFVGVR AAATLTERDG QAAWTDGTPP PGVPLDGDPL DVLRAVERQL CSARPSGTPP
LLGGLVGYLS YDIVRRIERL PARATDDLGM PELRMLLTTD LAVLDHTDGS CQLVANIFTG
AGDPADPAEA ADLAGPGAAG GPSARRAELD AAYDDAVHRI EVMTADLGKW SEPTVATTTG
ASTGVRDFAS ATPPGGFHAA VERSIEEIRA GECFQIVVSQ RFERPTTADA LDVYRVLRAS
NPSPYMYLLR FADHDVVGSS PEAHVKVTGR RALLHPIAGS RPRGETPERD AELAAQLLAD
PKERSEHVML VDLVRNDLGR VCVPGSVRVV EFASVERFSH IMHIVSTVIG EVAPERSAVD
VLAATFPAGT LSGAPKVRAM EIIDELEPTR RGLYGGVVGY LDFGGDLDTA IAIRTAVLRS
GMAYVQAGAG IVADSVPDTE DLESRTKAAA VLRAIEVAES LRPPV