Gene Franean1_4454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4454 
Symbol 
ID5672805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5320990 
End bp5322171 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content67% 
IMG OID641243322 
Productamidohydrolase 2 
Protein accessionYP_001508738 
Protein GI158316230 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGACTGA TGGATCCGGC GCTGTTAGAC GAAATCAAGA TCATTGACAC GGACACCCAT 
GTGGTGGAGC CACCGGATCT GTGGACCTCA CGGGTGTCGG TGCGCAAGTG GGGTTCGCTG
GTGCCGCACA TCCGGCCGGA CGCGTCGGGT GACCCGGCGT GGTTCGTCGG GGACCAGCGG
ATGTTGGGAG TGGCGGCGGC GGCGATGGCG GGCTGGCACG AGTACCAGCC GGATCATCCG
CTGCGGCTGG AGGACGCGGA CCGGGCGGCG TGGGATCCGG CGGCGCGACT CAAGCGGATG
GACGAGGACG GCATCCACGC GCAGGTCCTC TATCCAAACG TGGCGGGCTT CGGCGGGGGC
AACTTCACGA AGGTCGAAAA TCCGGATCTG ATGCTGGATC TGGTGCGCGC CTACAACGAC
TTCCTGACAG ATTTCGCCGG TGTCGCGCCG GGCCGTTACA TCCCGATCAG CGCCGTTCCG
TTCTGGGACC TGGAGCTGGC GGCCAAGGAG ATCGACCGGG TCGCGGCGGC CGGGCACAAG
GGTCTGATCA TGACGGCGGC GCCCGAGAAC TGGGGCCAGC CGTTCCTGGA GGACCCTCAC
TGGGACCCGC TCTGGGCGAA GGCCCAGGAG GTCGGCCTGC CGATCAACTT CCACATCGGG
TCGGGGGACA TCTCCGCGTA CCCGACGCAC CCCGGTGGGA AGCACGCCAA CTCGGCTTCG
CTCGCGGTCC TCAACTTCAT GGGTAACGCG GCGGCCATCG TCCGGGTGAT CTGTGGCGGT
ATCTGCCACC GGTTCCCGGA GCTGAACATC GTCTCGGTGG AAAGCGGCGT GGGCTGGATC
CCGTTCGCGC TCGAGGCGCT GGACTGGCAG TGGTACAACT GCGGTGTTCC GCAGGAGCAC
CCGGAGTACG AGCTGTCGCC GAAGGAGTAC TTCCTGCGGC AGGTCTACGG GTGTTTCTGG
TTCGAGCGGG ACACCGCGAT GAGCGCGATC AGCCAGGTCG GGGCGCGGAA CTTCATGTAC
GAGACGGATT TCCCGCACCC GACGAGCATG ACGCCCGGCC CGGCGTCGAT CGCGACGACG
CCGCGGGAGT ACCTGCTCGC CGCGATGGCC GATCTGCCGG ACGAGACCGT GCGACTGCTG
TTGCAGGACA ACGCCGCCCG CATCTACCAT CTCGATCTCT GA
 
Protein sequence
MRLMDPALLD EIKIIDTDTH VVEPPDLWTS RVSVRKWGSL VPHIRPDASG DPAWFVGDQR 
MLGVAAAAMA GWHEYQPDHP LRLEDADRAA WDPAARLKRM DEDGIHAQVL YPNVAGFGGG
NFTKVENPDL MLDLVRAYND FLTDFAGVAP GRYIPISAVP FWDLELAAKE IDRVAAAGHK
GLIMTAAPEN WGQPFLEDPH WDPLWAKAQE VGLPINFHIG SGDISAYPTH PGGKHANSAS
LAVLNFMGNA AAIVRVICGG ICHRFPELNI VSVESGVGWI PFALEALDWQ WYNCGVPQEH
PEYELSPKEY FLRQVYGCFW FERDTAMSAI SQVGARNFMY ETDFPHPTSM TPGPASIATT
PREYLLAAMA DLPDETVRLL LQDNAARIYH LDL