Gene Franean1_2912 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2912 
Symbol 
ID5671299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3428394 
End bp3429659 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content64% 
IMG OID641241819 
Productamidohydrolase 2 
Protein accessionYP_001507239 
Protein GI158314731 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTAG AGGAAATGAT CCTGGTTAGC ATCGACGACC ACGTCATCGA GCCGCCGGAC 
ATGTACAAGA ACCACGTCCC GGCCAGATGG CGCGACGAGG TTCCGAAGGT CGTGCGGAAC
AGGACGGGCA TCGACGAGTG GGTGTTCCAG GGGCAGAAGA CCTCTACCCC GTTCGGGATG
GCGGCGACCG TGGGCTGGCC CCGGGAGGAA TGGGGCTTCA ACCCCGGGGC GTTCGCCGAG
CTGCGGCCGG GCTGTTTCGA TGTGCGCCAA CGGGTACAGG ACATGAACGC CAACGGCGTC
TTGGCCTCGA TGTGCTTCCC GACGATGGCC GGCTTCAACG CTCGCACATT CACCGAAGCG
ACCGACAAGG AACTGTCGCT CGTCATGCTC CAAGCCTACA ACGACTGGCA CATCGATGAA
TGGTGCGCCA CGTGCCCCGG CCGGTTCATC CCGCTCGGCA TCGTGCCGAT GTGGGATGTC
GGCCTGGCCG TTAAAGAAAT AAAGCGGATC GCGAGGAATG GATGCCGCGC CGTCGGCTTC
CTGGAGGCGC CCCACTCCCA GGGCTGGCCA AGTTTCCTTT CCGGCCACTG GGACCCGATG
CTTCAAGCAC TCGTGGAGGA GAACATGGTC CTCTGCCTGC ACATCGGCGG GGCACGGGAC
CTTGTCACGA CCGCACCCGA GGCGCCGGTC GACCACATGG TCATCATCCC CTCTCAGCTC
ACCATTCTCA CCGCGCAGGA CCTTCTGTTC GGACCAACAC TGCGACGCTT TCCCACTCTC
AAGGTCGCCC TGTCCGAAGG TGGCATCGGC TGGATCCTAT TCTACCTTGA CCGGGTCGAC
CGCCACGTCA CCAACCAGAC TTGGATCCAC AACGACTTCG GCGGCAGACT TCCGTCCGAG
GTGTTCCGCG AGCACTTCCT GGCCTGCTAC ATCACCGACC CCGCCGGCCT CGAGCTCCGC
CACCGGATCG GCCTCGACAC CATCGCCTGG GAGTGCGACT ACCCGCACAC CGACACGACC
TGGCCGGAGT CTCCCGAGAC GGCCTGGAAC GAACTGCAGG GAGCCGGCTG CACCGACGAG
GAGATCAATG AAATCACCTG GGAGAACGCC AGCATATTCT TTGGCTGGGA CCCATTCACC
CATATCCCCC AGGATCAGGC CAGCGTCGGC GCTCTGCGCG CACTCGCCAC AGACGTGGAC
CTCACCAGAA TGCCACGCGA GGAGTGGCGC AGGCGTAACG AGGCGGCCGG GATAGGCGCC
TTCTGA
 
Protein sequence
MQLEEMILVS IDDHVIEPPD MYKNHVPARW RDEVPKVVRN RTGIDEWVFQ GQKTSTPFGM 
AATVGWPREE WGFNPGAFAE LRPGCFDVRQ RVQDMNANGV LASMCFPTMA GFNARTFTEA
TDKELSLVML QAYNDWHIDE WCATCPGRFI PLGIVPMWDV GLAVKEIKRI ARNGCRAVGF
LEAPHSQGWP SFLSGHWDPM LQALVEENMV LCLHIGGARD LVTTAPEAPV DHMVIIPSQL
TILTAQDLLF GPTLRRFPTL KVALSEGGIG WILFYLDRVD RHVTNQTWIH NDFGGRLPSE
VFREHFLACY ITDPAGLELR HRIGLDTIAW ECDYPHTDTT WPESPETAWN ELQGAGCTDE
EINEITWENA SIFFGWDPFT HIPQDQASVG ALRALATDVD LTRMPREEWR RRNEAAGIGA
F