Gene Franean1_6663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6663 
Symbol 
ID5674978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8092844 
End bp8093845 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content75% 
IMG OID641245514 
ProductNitrilase/cyanide hydratase and apolipoprotein N-acyltransferase 
Protein accessionYP_001510906 
Protein GI158318398 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00315162 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.834445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCGG ACCGCGGGGA CGGCGCGCGT GGGGGGCCGG AGCCGGCCTC CCACGCGGTG 
ACGATCGCCG CGGTGACCGC GCCGTTCACC CGTGACCTGG ACGAATGCCT CGCGGCGATC
AGTCGGCTGG TCGACGGGGC CCGGCGGCGC GGGGTGGATC TCCTCGTCCT GCCCGAGGGG
GCGCTCGGCG GCTACCTGCG GGCGCTGCCC CCGCGCGGGG ACGATCTGGC CCCGCGCGGG
GGGCCGCCCG CCCTCGATCC GGACGGTCCG GAGATCACCC GGCTGGCGGC GATCGCCGGG
GATATGGTGG TCTGCGCGGG ATACGCGGAG CGTGACGGGC GGTACCGGTA CAACAGTGCG
GTGTGTGTGC ACGGTGATGG CGTCCTCGGC CGGCATCGCA AGGTCCACCA GCCGCTCGGC
GAGTCCCTCG CCTACGAGGC CGGCCGTTCC TTCACCGCGT TCGACAGCCC ACTCGGCCGG
ATGGGGATGA TGATCTGTTA CGACAAGGCG TTCCCCGAGT CCGGGCGTAG CCTGGCGCTC
GCTGGCGCGG ACATCATCGC CTGCCTGTCG GCCTGGCCGG CCTCGCGTAC CCACGCGGCC
GACGACATCG CCGCGGACAG GTGGCGGCAC CGCTTCGACC TCTACGACCA GGTGCGCGCA
TTGGAGAACC AGGTCGTGTG GGTCTCGTCC AACCAGGCCG GCACGTTCGG CTCGCTGCGC
TTCGTCGGCA ACGCGAAGAT CGTCCATCCG GACGGCTCCG TGCTCGCCTC GACCGGCACG
GGCGCGGGGA TGGCCGTCGC GACGGTCGAC GTCACCGCGG CGCTACGGGC GGCCCGCAGC
GGCCTGAACC ACCTGCGGGA CCGCCGCGGC GCGAGCTACG AGAAGCAGTG CCTGCTCGCC
GGTAAGCCCT ACGACCCGCG GCGGGCCGCC CGCCCGGCGG CCCCCCGGCC CACGGCCGAG
CACGGATCTC GCCCGTCCCG GCCCGCGGGA CCGGCGCACT GA
 
Protein sequence
MSADRGDGAR GGPEPASHAV TIAAVTAPFT RDLDECLAAI SRLVDGARRR GVDLLVLPEG 
ALGGYLRALP PRGDDLAPRG GPPALDPDGP EITRLAAIAG DMVVCAGYAE RDGRYRYNSA
VCVHGDGVLG RHRKVHQPLG ESLAYEAGRS FTAFDSPLGR MGMMICYDKA FPESGRSLAL
AGADIIACLS AWPASRTHAA DDIAADRWRH RFDLYDQVRA LENQVVWVSS NQAGTFGSLR
FVGNAKIVHP DGSVLASTGT GAGMAVATVD VTAALRAARS GLNHLRDRRG ASYEKQCLLA
GKPYDPRRAA RPAAPRPTAE HGSRPSRPAG PAH