Gene Francci3_1242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1242 
Symbol 
ID3903541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1486473 
End bp1487687 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content72% 
IMG OID637878576 
ProductNitrilase/cyanide hydratase and apolipoprotein N-acyltransferase 
Protein accessionYP_480349 
Protein GI86739949 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID[TIGR03618] PPOX class probable F420-dependent enzyme 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.194401 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCCC CGGTCCGTGC CGCCGTCCTG CAGCTCGGCT GTCCCGACGA GGAGAACGCG 
GCGGACCGGG TGCGCAGGGT GCTGGGGGAG ATCCGGCAGA CCCAGGCGGA CCTGGTCGTC
CTCCCCGAAC TATGGGTCAC CGGCTATTTT CACTTCGATC GGTACGAGGC GGAGGCGGAG
GCGTTGACCG GGCCGACGGT CACCGCCCTG CGGGAGGCCG CCCGGGAACG CGGATGCCAC
CTGGTGGCCG GCAGCATCGT GGAACGGTCC GCCGACGGAC GGCTTTTCAA CACGACCGTG
TTGATCGGTC CGGACGGGAT GATCCGGCAT GCGTACCGCA AGGTGCACCT GTTCGGCTAC
GGCTCCGCGG AGGCCCGGTT GCTGACGCCG GGTGCCACGG TCGGAACCGT CCCCACCGAA
CTCGGCATCG TCGGGCTGGC GACCTGTTAC GACCTCCGGT TTCCCGAGCT GTTCCGGCTG
CTGGCCGAGG GCGGTGCCGA GATCGTGGTG GTGGTATCGG CCTGGCCGTT GGCCAGGCTG
GACCACTGGC GCGTGCTCAC CCGGACCCGG GCCATCGAGA ACCAGGTGTA TCTCGTCGCC
TGCAACGCCG CGGGCCGCCA GGCGGGCCGG GAGATGGCCG GGGCGAGCGT CGTCGTCGAT
CCGTGGGGAG AGGTGCTCGC CGAGGCCGGC CCGCGGCCCA CCACGGTGCG CGCGGAGCTG
GACCCGTCCC GACCGGCCGC CGTTCGTGCC GAGTTCCCGG TGCTCACGCA TCGCCGGCTC
GGGGTCGACC ACCAGAACCG GCTTGACCCC GCGCATCTGC TCGACCTGGC CGGGCGTGGC
AAGGCCTTTC TTGACTTCTG GCGAGAGCGG CACCTGTGCA GCCTGACCAC CGTCCGGCCG
GATGGCAGCC CGCACGTGGT CGCCGTCGGC GCGACACTTG ATCCGGCGGC GGGCATCGCC
CGGGTGATCA CATCGGCCCG GTCCCGCAAG GCCCGTCTGA TCGCGGCGGG ACCGGCCCAC
GGCACCCCCG TCGGGCTCTG CCAGATCGAC GGCCGACGCT GGTCGACGCT GGAGGGACTG
GCCGTGCTCC GGGATGATCC GGCGAGTGTG GCCGACGCCG AGTTCCGCTA TGCGCAGCGT
TACCGCGCGC CGCGGGCCAA TCCCCGGCGT GTCGTCGTGG AGGTGCGGAT CACCCGCGTG
CTCGGCAGCG TCTGA
 
Protein sequence
MTAPVRAAVL QLGCPDEENA ADRVRRVLGE IRQTQADLVV LPELWVTGYF HFDRYEAEAE 
ALTGPTVTAL REAARERGCH LVAGSIVERS ADGRLFNTTV LIGPDGMIRH AYRKVHLFGY
GSAEARLLTP GATVGTVPTE LGIVGLATCY DLRFPELFRL LAEGGAEIVV VVSAWPLARL
DHWRVLTRTR AIENQVYLVA CNAAGRQAGR EMAGASVVVD PWGEVLAEAG PRPTTVRAEL
DPSRPAAVRA EFPVLTHRRL GVDHQNRLDP AHLLDLAGRG KAFLDFWRER HLCSLTTVRP
DGSPHVVAVG ATLDPAAGIA RVITSARSRK ARLIAAGPAH GTPVGLCQID GRRWSTLEGL
AVLRDDPASV ADAEFRYAQR YRAPRANPRR VVVEVRITRV LGSV