Gene Franean1_0094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0094 
Symbol 
ID5668519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp111434 
End bp112696 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content67% 
IMG OID641239022 
Productamidohydrolase 2 
Protein accessionYP_001504467 
Protein GI158311959 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.986217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCG ACGACATGGT CTTGGTGAGC ATCGACGACC ACGTGGTCGA GCCGCCCGAC 
ATGTTCAAGA ACCACGTCCC GGCGAACCTG GTGGACCAGG CGCCGCACGT CGTGCGCAAC
GACAAGGGCG TGGACCAGTG GATCTACCAG GGCCGGGTGA CGGGCGTCAG TGGCCTGAAC
GCGGTCGTGT CGTGGCCGGC GGAGGAGTGG GGCAAGGACC CGGCCGGCTT CGCCGAGATG
CGCCCGGGGG TGTACGACAT CCACGACCGG GTCCGGGACA TGGACCGTAA CGGGATCCTG
GCGTCGATGT GCTTCCCGAC GTTCGCCGGG TTCTCCGCCG GGCATCTCAA CCACTACAAG
ACCGACACCA CGGTCACGAT GGTCCAGGCG TACAACAACT GGCACATCGA CGAGTGGGCC
GGCACCTACC CGGGCCGGTT CATCCCGCTG GCGCTGCTCC CGACCTGGGA CCCGCAGCTG
ATGGTGAACG AGATCCGCCG GGTGGCGGCG AAGGGCTGCC GGGCGGTCAC CATGCCCGAG
CTGCCGCACC TCGAGGGCCT GCCCAGCTAC CACAACCTCG ACTTCTGGGC TCCGGTGTTC
GAGGCGCTGT CCGACACCGG GATGGTGATG TGCCTGCACA TCGGAACCGG GTTCGGCGCG
CTCAAGCTCG CCCCGGACGC GCCGATCGAC AACCTGATCA TCCTGGCGTG CCAGATCTCC
TCGCTGGCCG TGCAGGACCT GTTGTGGGGC CCGGCGATGC GGACCTACCC GGACCTGAAG
TTCGCCTTCT CCGAGGGCGG CATCGGCTGG ATCCCGTTCT ACCTGGACCG CTGCGACCGG
CACTACACCA ACCAGCGCTG GCTGCGCCGC GACTTCGGCG GCAAGCTGCC CAGCGAGGTG
TTCCGCGACC ACTCGCTCGC CTGCTACGTC ACCGACCCGA CGTCGCTGAA GCTGCGCCGT
GAGATCGGGA TCGACATCAT CGCCTGGGAG TGCGACTACC CGCACTCGGA CTCGATCTGG
CCGGACGCGC CGGAGTTCGT GCTCAACGAG CTGAACAACG CGGGTGCGAC CGACGAGGAG
ATCGACAAGA TCACCTGGCA GAACGCCTGC CGGTTCTTCA ACTGGGACCC GTTCTCCGAG
ATCCCCAAGG AGCGCGCGAC CGTCGGCGCC CGCCGGGCCA TCGCCACCGA CGTCGACACC
GCCATCCGCT CCCGCAAGGA ATGGGCCCGC CTCTTCGCGG AGAAGCACCC CGAGACCATC
TGA
 
Protein sequence
MNIDDMVLVS IDDHVVEPPD MFKNHVPANL VDQAPHVVRN DKGVDQWIYQ GRVTGVSGLN 
AVVSWPAEEW GKDPAGFAEM RPGVYDIHDR VRDMDRNGIL ASMCFPTFAG FSAGHLNHYK
TDTTVTMVQA YNNWHIDEWA GTYPGRFIPL ALLPTWDPQL MVNEIRRVAA KGCRAVTMPE
LPHLEGLPSY HNLDFWAPVF EALSDTGMVM CLHIGTGFGA LKLAPDAPID NLIILACQIS
SLAVQDLLWG PAMRTYPDLK FAFSEGGIGW IPFYLDRCDR HYTNQRWLRR DFGGKLPSEV
FRDHSLACYV TDPTSLKLRR EIGIDIIAWE CDYPHSDSIW PDAPEFVLNE LNNAGATDEE
IDKITWQNAC RFFNWDPFSE IPKERATVGA RRAIATDVDT AIRSRKEWAR LFAEKHPETI