Gene Franean1_3331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3331 
Symbol 
ID5671703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3943078 
End bp3944253 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content67% 
IMG OID641242220 
Productamidohydrolase 2 
Protein accessionYP_001507640 
Protein GI158315132 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.550336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCAAGC TGGCTGACGG AATTCGAGTG GTCGACGCCG ACGCGCACAT GACCGAGCGC 
CACGACCTGT TCACCGAGAG GGCGCCGAAG GGCTACGAGG ACAGGGTCCC GCACGTCGAG
CGGATCGACG GCGTCGACAT GTGGATCGTC GAGGGCAAGG CGTTCGGCAA GGCAGGCTCC
GGCGGCACCG TCGACCACGA CGGGAAGAAG CACCCGTTCC GGGACTCCCA AGGCGGGTCC
TGGGGCATCA ACGACGTGCA CCCCGCGGCG TGGGACCCGA AGGAGCGCCT GCGCCTGATG
GATGAGCTCG GCATCCACAC GCAGGTCCTC TACCCCAACG CGATCGGCAT CGGCGGCCAG
AACCTGCGGA ATTCGGTCCA GGACCCGATC GTTCTCCGGC TCTGCGTCGA GCTCTACAAC
GACGCGATGG CGGAGGTCCA GGCGGAGTCG GGCAACCGGC TGCTCCCGAT GCCGATCATG
CCCGCGTGGG ACGTCGAGGC CTGCGTCCGG GAGGCCGAGC GCTGCGCCGC CCTGGGCTAC
CGCGGGGTCA ACATGACCGC CGACCCGCAG GACTCCGGCT CACCCGACCT GGGCGACACC
GCCTGGGACC CGTTCTGGGA GGTCTGCGCC GGGAACAAGC TCCCCGTGCA CTTCCACATC
GGGGCGAGCC AGACGGCGCT GTCCTACTTC GGCACGACCT ACTGGCCCAG CCAGGACGAC
TACGTGAAGC CGGCGATCGG CGGCGCGTCG CTGTTCCAGA ACAACTCCCG GGTACTGCTC
AACAGCGCCT ACTCCGGGAT GTTCGACCGT CACCCCGACC TGAAGATGGT CTCGGTCGAA
AGCGGCATCG GCTGGGTGCC GTTCATGCTC GAGGCGATGG ACTACGAGCT TGAGGAGAAC
GCACCGGAGT ACTTTCACAA GCTGCAGAAG CGGCCGTCGG AGTACTTCGC GTCGAACTGG
TACGCGACCT TCTGGTTCGA GAAGGGCCGC GGCGACCTCC AGCACCTCAT CGACACCGTC
GGCGAGGACA ACATCATGTT CGAGACGGAC TTCCCGCACC CGACCTGCCT GCACCCGGAC
CCCCTCGGAA TCGTTGGCGA GACGATCGCC TCGCTGCGTC CCGAGACGCA GCGGAAGGTC
ATGGGCGGCA ACGCGGTCAA GCTCTACCGC GTCTGA
 
Protein sequence
MVKLADGIRV VDADAHMTER HDLFTERAPK GYEDRVPHVE RIDGVDMWIV EGKAFGKAGS 
GGTVDHDGKK HPFRDSQGGS WGINDVHPAA WDPKERLRLM DELGIHTQVL YPNAIGIGGQ
NLRNSVQDPI VLRLCVELYN DAMAEVQAES GNRLLPMPIM PAWDVEACVR EAERCAALGY
RGVNMTADPQ DSGSPDLGDT AWDPFWEVCA GNKLPVHFHI GASQTALSYF GTTYWPSQDD
YVKPAIGGAS LFQNNSRVLL NSAYSGMFDR HPDLKMVSVE SGIGWVPFML EAMDYELEEN
APEYFHKLQK RPSEYFASNW YATFWFEKGR GDLQHLIDTV GEDNIMFETD FPHPTCLHPD
PLGIVGETIA SLRPETQRKV MGGNAVKLYR V