Gene Franean1_0246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0246 
Symbol 
ID5668671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp300704 
End bp301759 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content75% 
IMG OID641239176 
Productshort chain dehydrogenase 
Protein accessionYP_001504619 
Protein GI158312111 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.153805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.728482 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCGC AGGCGACACC ACCCGCGACG GCCCGCACGC CGGCCGCGGC GGCCCGCACG 
CCCTGGGCTC CCGACAGGAG CGCCCTTCGC GGGCGCGTGG CCGTCGTCGC CGGCGCCACC
CGCGGCGCGG GTCGCGGGAT CGCGGCGGCG CTCGGTGAGG CCGGCGCCAC CGTCATCTGC
ACCGGCCGCA GCAGCAGGAC GGGCGTCCTG CGCTCCGACT ACGACCGCGC CGAGACGATC
GAGGAGACCG CGGAGCTCGT CACCAAGCTC GGTGGTGCCG GCATCGCCGT CCCCGTCGAC
CATCTGGACC CGGAGCAGGT GCGACGACTG GCCGACCGCG TCCGCGCCGA GCACGGGCAC
CTCGACGTGC TCGTCAACGA CATCTGGGGC GGCGAGGTCC TCAAGGGCGG GCCGAGCGAG
TGGGACACGC CGGTCTGGGA GCACGACCTC GACCGCGGGA TGCGCATCCT GCGGCTCGCC
GTGGACACCC ACCTGATCAC CTCCCACCAC CTGCTCCCGC TGCTGATCGA CCGCCCAGGC
GGGCTGGTCG TCGAGGTGAC CGACGGGACG ACGGACTACA ACGCGGCCAA CTACCGGATC
TCCGTGTACT ACGACCTCGC CAAGGTCGCC GTGAACCGGC TCGCCTTCTC ACAGGGCCAC
GAGATCGCCT CGCACGGCGG GACCGCCGTC GCCGTCACGC CCGGCTGGCT GCGCTCGGAG
ATGATGCTCG AGGCGTTCGG CGTCACCGAG GAGACCTGGC GCGACGCCCT CACCCCGCCC
GACAGCGCCT CGCCCAGCGG TGCTCCGCCC AGCGGTGCTC CGCCGGACTT CGCGTTCTCC
GAGTCGCCGC GTTATGTCGG GCGCGCCGTC GCCGCGCTGG CCGCCGATCC GGGACGGGCC
CGCTGGAACC AGCGCTCGGT GACCTCCGGC CGGCTCGCGG CCGAGTACGG CTTCACCGAC
GTCGACGGGT CACAACCGGA CATCTGGCCG CGCCTGGAAC GTCCGGCGGA GGCCCCGGCG
CCGGCGGCCG GTCAGGAGCC GCACCCGCGC CGGTAG
 
Protein sequence
MSAQATPPAT ARTPAAAART PWAPDRSALR GRVAVVAGAT RGAGRGIAAA LGEAGATVIC 
TGRSSRTGVL RSDYDRAETI EETAELVTKL GGAGIAVPVD HLDPEQVRRL ADRVRAEHGH
LDVLVNDIWG GEVLKGGPSE WDTPVWEHDL DRGMRILRLA VDTHLITSHH LLPLLIDRPG
GLVVEVTDGT TDYNAANYRI SVYYDLAKVA VNRLAFSQGH EIASHGGTAV AVTPGWLRSE
MMLEAFGVTE ETWRDALTPP DSASPSGAPP SGAPPDFAFS ESPRYVGRAV AALAADPGRA
RWNQRSVTSG RLAAEYGFTD VDGSQPDIWP RLERPAEAPA PAAGQEPHPR R