Gene Francci3_2516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2516 
Symbol 
ID3904660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2972892 
End bp2973929 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content61% 
IMG OID637879846 
Productamidohydrolase 2 
Protein accessionYP_481612 
Protein GI86741212 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.125947 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGTCA AGGACAATGA GCGATACTTC ATCGTGGATT CCCACCTGCA CTTCTGGGAT 
GGGAGCCCGG AAAACCAGGC GAACCGCTAC GGCAAAGGTT TCATCGACTG CTTCTACGAT
TACCATGTGA ATCTGAGCCC GCAGGAGTAC CTCTGGCCGC GGGAAAAGTT CCAGAAGTAC
TCTGCGGAAG TCATGGTGAA GGACCTGTTC GAGGACGGTT ACGTCGACAA GGGGATCTTC
CAGCCCACTT ATCTGACGGA CTTCTACCGG AATGGTTTCA ACACCACCGA GCAGGACGGC
GCGCTCGCCG AGCGGTACCC CGGCAAGTTC ATCGTGAACG GCGCCTTCGA CCCGCGTGAC
GGCGAACTGG GCCTGTCGAA GCTGGCGGAC CTGGCGGCAC GCTGGAACCT CAAGGGTGTG
AAGCTCTACA CGGCGGAGTG GAAGGGCGAG TCCAAGGGCT ACAAGCTGAC CGACCCGTGG
GTCTACCGGT ATCTGGAGAA GTGCCAGGAA CTCGGCATCC GCAACATCCA CATCCACAAG
GGCCCGACGA TCTACCCGCT GAACCGGGAC GCGTTCGATG TCGCCGACGT CGATGATGTG
GCCACCGAAT TCCCCGAGCT GCGGTTCATC ATCGAACACG TCGGACTGCC CCGGTTGGAG
GACTTCTGTT GGATCGCCAC GCAGGAGCCC AATGTCTACG GTGGGCTCGC GGTGGCCATG
CCGTTCATCC ACAGCCGGCC GCGCTACTTC GCGCAGATCA TCGGTGAGCT CCTCTACTGG
CTCGACGAGA ACCGGCTGAC CTTCTCGAGT GACTACGCGA TCTGGCACCC CAAGTGGCTG
GTCGAGAAGT TCGTCGACTT CCAGATCCCG GCGGACATGC AGGCCGAGTA CGGCGTGCTC
ACCCCCGACA TCAAGCGCAA AATTCTCGGT CTCAACGCGG CCGCGCTCTA CGACATCGAG
GTACCGGCCG AGGTTAGCGG GGCGGGCAGC GGTTCTCCGG CGTCGACGCC TCTCGTGGGC
GCAGGGCAGT CCGTATGA
 
Protein sequence
MYVKDNERYF IVDSHLHFWD GSPENQANRY GKGFIDCFYD YHVNLSPQEY LWPREKFQKY 
SAEVMVKDLF EDGYVDKGIF QPTYLTDFYR NGFNTTEQDG ALAERYPGKF IVNGAFDPRD
GELGLSKLAD LAARWNLKGV KLYTAEWKGE SKGYKLTDPW VYRYLEKCQE LGIRNIHIHK
GPTIYPLNRD AFDVADVDDV ATEFPELRFI IEHVGLPRLE DFCWIATQEP NVYGGLAVAM
PFIHSRPRYF AQIIGELLYW LDENRLTFSS DYAIWHPKWL VEKFVDFQIP ADMQAEYGVL
TPDIKRKILG LNAAALYDIE VPAEVSGAGS GSPASTPLVG AGQSV