Gene Franean1_1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1804 
SymboltrpD 
ID5670206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2166921 
End bp2168219 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content77% 
IMG OID641240725 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001506148 
Protein GI158313640 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.204599 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACAA CGACTGCAGA CGGACACGAC ATGACGAGCA CCGCCTCGGA GCCGCGGGCG 
ACGACGGCCG CGGCCTCCCC CCTCCCGACG GCCGCCTCTT CCCCGGCCGA GTCCCCCTCC
CCGGGCGTCA CCGCTGCCGG GGGGGCCCGG GCCACCGCCG AGAGCTGGCC GGATCTGATC
ACCGACCTGA TCGCCGGGCA GGCGCTGGCC GCCGACCGGA CGGCGTGGGC CATGGAGCAG
ATCATGGGCG GGCTGGCGAC GCCGTCCCAG ATCGCCGGCT TCGTGGTGGC GCTGCGGGCC
AAGGGTGAGA CCGCCCAGGA GATCGGCGGG CTGGTCCGCA CGATGCTCGG CTTCGCCGAG
CCGCTCACCC TCAGCGAGGA GCTGCGCGCC GCCGCAGTCG ACACCTGCGG AACCGGCGGC
GACCGCTCGA ACACCGTGAA CCTGTCGACC ATGGCCGCGA TCGTGGCGGC CGGCGCCGGG
GTCACCGTGG TCAAGCACGG TAACCGCGCG GCGTCGTCGG CGAGTGGGTC GGCCGACGTG
CTGGCCGAGC TCGGCGTTGT CATCGACCTG CCGCCGGCCG GGGTGGAAGC GTGCCTGGCC
GCCGCGGGGA TCGCCTTCTG CTTCGCCCCG GTCTTCCACC CGGCGATGCG GCACGTCGGC
GCCACCCGCA AGGAGCTGGG GGTGCAGACC GCGTTCAACA TCCTCGGCCC GCTGGCGAAC
CCGGCGCGGC CGGGCGCCCA GACGATCGGC GTGGCCGACG CGCGGCTGGC CCCGGTCGTC
GCCGACGTGC TCGCCGAGCG GGGAACCCGG GGCCTCGTCT TCCGCGGCGA CGACGGACTG
GACGAACTCA CCACGGCCAC CACGTCGACC GTGTGGGTCG TCCAGGCGCC CGACCCCACG
TCCAGCCGCA CGCCGGGTTC CACGGCCGGC TCCGTGGCCG GGAGCGCGTC GGAGGCCGCG
GCGGTGCGCC GCTCCCGGGT TCGTTCGGAG CACTTCGACC CTCGCGACCT CGGCCTCGCC
CGGCCGGACA CGACCGCGCT GCGGGGCGCG GACGCCGCCT ACAACGCTTC CGTGGCCCGG
GCCATGCTGC GCGGGGAGAC CGGGCCGGTC CGCGACGCGG TGCTGCTCGC CGCGGCCGCG
ACCCTGGTCG CGGTGGACGG CCCCACCGAC GCCCCGGTGG CCGAGCAGAT AGCGGCCCAG
CTCGGGCGCG CCACCGAGGC CGTCGACTCC GGTGCCGCCG CGGCCGCGCT GAGCCGCTGG
GCCGAGGCCA GCCAGCTCGC GGCGACGGCC CGGGGCTGA
 
Protein sequence
MRTTTADGHD MTSTASEPRA TTAAASPLPT AASSPAESPS PGVTAAGGAR ATAESWPDLI 
TDLIAGQALA ADRTAWAMEQ IMGGLATPSQ IAGFVVALRA KGETAQEIGG LVRTMLGFAE
PLTLSEELRA AAVDTCGTGG DRSNTVNLST MAAIVAAGAG VTVVKHGNRA ASSASGSADV
LAELGVVIDL PPAGVEACLA AAGIAFCFAP VFHPAMRHVG ATRKELGVQT AFNILGPLAN
PARPGAQTIG VADARLAPVV ADVLAERGTR GLVFRGDDGL DELTTATTST VWVVQAPDPT
SSRTPGSTAG SVAGSASEAA AVRRSRVRSE HFDPRDLGLA RPDTTALRGA DAAYNASVAR
AMLRGETGPV RDAVLLAAAA TLVAVDGPTD APVAEQIAAQ LGRATEAVDS GAAAAALSRW
AEASQLAATA RG