Gene Franean1_0638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0638 
Symbol 
ID5669055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp742461 
End bp743669 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content68% 
IMG OID641239565 
Productamidohydrolase 2 
Protein accessionYP_001505003 
Protein GI158312495 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.477012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.622567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAGGC CGAACTGGCA GCTGCTTCCG GACCCAGAGC CGCAGGAACG CAGCTACTCG 
ATCGTCTCGG TGGACGATCA TCTGACCGAG CCTGCCGACA TCTTCGTCAA GCGGTTCCCG
GCGAACCTTC GTGACAAGGC ACCTCAGGTG ATCACGACAC CAGACGGTTC GGAGGCCTGG
GTCTACCGCG ACCGGCTCTA CCGTGACAAC GGAATGAGTG TCGTCGCCGG GCGCCCGCAG
TCCGAGTGGA ACCTCGACCC GCTGAACTTC AGCGAGATGC GCCGGTCCGC GTGGGACGTC
CATGCCCGCG TGAAGGACAT GGACCTCGAC GGCATCTGGG CGTCACTGTG CTTCCCCTCC
GGGGCGTGGG GCTTCACCGG CCGCGTGCTG TCGATGAACA ACGACCAGGA GGTCGGGCTC
GCCGCGGTCC GTGCCTGGAA CAGCTGGATG ATCGAGGAGT GGCACGGGGC GTACCCGGAG
CGCTTCATCC CGATGCAGCT GCCCTGGTTC AAGGACCCCG AGGTCGCCGC CGAGGAGATC
CGCCGCAACG CCGAGCTCGG CTTCACCTCG GTGTCGTTCC TCGAGTCGCC GCACCTGCTC
AAGCTTCCGC CGATCACCAA CCACAAGCAC TGGGAGCCGT TCTTCAAGGC GTGCGAGGAG
ACCGACACGG TCATTTCGCT GCACTGCGGC GCGAGCGGCT TCGTCCTGCA GGGCTCGCCG
GGCGGCGGCC TGAACGTGCA GACGTCGCTC TTCCCGGCGG GCGCGTTCTG CGCGGCCGTG
GACTGGGTGT GGGCGGGCAT CCCGGCGCTC TACCCGAACC TCAGGATCGC GCTGAGCGAG
GGTGGCATCG GCTGGGTGCC GATGGCGATC AACCGCCTCG ACTACGTGCT CGAGCACTCC
GGCAGCGGCG GCACGCCGTG GACGTACGAC GTGACGCCGA GCGAGGCGCT GCGCCGGAAC
TTCTACTTCT GCATGCTGGA CGACCCCGGC ACGCTCGACC AACGTCACAT GATCGGGATC
GACCACATCC TGTTCGAGAC GGACTTCCCG CACGCCGACT CGACCTGGCC GGGCTCGCAG
GACCTGCTGC GCAAGCGTTT CGCCGACATC CCTCGGCACG AGGCCGTGAT GATCGCCGGT
GGCAACGCCG CGAGGCTGTT CCGGCACCCG CTCCCGCAGG GCGGCGACTG GCCGGCGATC
ACCCCGTAG
 
Protein sequence
MPRPNWQLLP DPEPQERSYS IVSVDDHLTE PADIFVKRFP ANLRDKAPQV ITTPDGSEAW 
VYRDRLYRDN GMSVVAGRPQ SEWNLDPLNF SEMRRSAWDV HARVKDMDLD GIWASLCFPS
GAWGFTGRVL SMNNDQEVGL AAVRAWNSWM IEEWHGAYPE RFIPMQLPWF KDPEVAAEEI
RRNAELGFTS VSFLESPHLL KLPPITNHKH WEPFFKACEE TDTVISLHCG ASGFVLQGSP
GGGLNVQTSL FPAGAFCAAV DWVWAGIPAL YPNLRIALSE GGIGWVPMAI NRLDYVLEHS
GSGGTPWTYD VTPSEALRRN FYFCMLDDPG TLDQRHMIGI DHILFETDFP HADSTWPGSQ
DLLRKRFADI PRHEAVMIAG GNAARLFRHP LPQGGDWPAI TP