Gene Franean1_3356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3356 
Symbol 
ID5671727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3973682 
End bp3975316 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content70% 
IMG OID641242244 
Productcholesterol oxidase 
Protein accessionYP_001507664 
Protein GI158315156 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACA ACGCTTCCAC ACCTCTGGAA TCCGGCCCGG GGCTGACCCG CCGGAGGATC 
CTCGGCACAG TGGCGCTTGG CGCGGCTGCG GCTGGTGGGC TGGCGGCCGG CCCTGGCAGC
GGACGGGCAC GGGCCGCCAC AGCCCCGGCG GACGCGGCCG TCACCGAGCA GCGGGAGCGG
GCCGTCGTCA TCGGCAGCGG ATTCGGCGGC GGCGTGACCG CGCTGCGGCT GGCACAGGCC
GGAGTGCCCA CTCTTGTCCT GGAGCGTGGC CTGCGGTGGT CGACGGGGCC GAACGCGACG
ACCTTCTGCC GCTGGGCGAA CATCGACAAC CGGTCCGCCT GGCTGACCGA CCACAGCACG
ACCCCGGGCG TGGACAAGAC CTGGACGCCC TACACCGGAG TGATCGAGAG CCTGCCGGGC
GACGGCATAA CGGTCAACTG CGGTGCCGGC GTCGGTGGCG GCTCGCTGGT GTACCACGGG
ATGACACTGC AGCCGACCAG CCAGAACTTC GCGGCATCCA TCCCGGAGGC CGCGAGTCTC
TACCGCGACC TGAACCGGTG GGCCTACCCC ACCGTCGCCA CCATGCTGGG CGTCTCCACG
GTGCCGAACG ACGTGCTGAA CGCGGATCAG TACCGGTCGT CCCGGCTCTT TCTGGACGTG
GCCGCCAACA GCGGCCTGGA GCCGTTCCGG GTTCCGCTGC CGGTCGACTG GAGCTACGTC
CGGGGTGAGC TCACCGGTCA GTACGAGCCG ACCTACACCA CCAGCGACAT CGTCTTCGGC
GTCAACAACG GCGGCAAGCA CTCGATCGAC GTCACCTACC TGGCCTCGGC CGAGGCCACG
GGCCGGGTAC GGGTGAGCCC ACTGCACGTG GTGCGGGACG TCCAGATGGA CCCGGACGGC
CGCTGGGTGC TCTCCGTCGA CCGCATCGAC ACCGGCGGCA CCGTGCAGGA GCGGAAGCGG
ATCACCGCGG ACGCGGTGTT CCTCAACGCC GGTTCCGCCG GTACCACGCG GCTCCTGGTC
AAGGCGCGAG CCAAGGGGCT GGTGCCCGAC CTCCCCGACG CCATCGGCAC GAAGTGGGGA
AACAACGGTG ATCGGATCTA CGCCTGGGTC GGCATGAACG GTGATCCTGG CACCCGGCAG
GGCGGCCCGG CGTGTGTGGG CGGCCGGGAC ACACAGGGGC CGATTCCGGC CACCGTCATC
CATGCGGGTG CGCCCGCCGA CACCGGCGGC GTGAAGCTGA TGACGGTCGT CGGTTTCGGG
ATCGTCGACG CCGCTGGCAC GTGGGCCTAC GACCCGAACA CGGATGACGC CAGGCTCACC
TGGCCGGCGA CCGGTGACGC CGCGCTCCAG ACCCAGATCG CCGCCCGGAT GCAGGCGATC
GTCGCCGCGG GTGGCGGGAT GATGATCGAC ACGAACGCCC AGGCGAACTC GACCTGGCAT
GCCCTGGGCG GGGTGCCGAT GGGATCGGCG GTCGACCTGT ACGGCCGGAT CATCGGCCAC
AAGGGCCTCT ACGTACTCGA CGGGTCGCGG ATCCCGGGCT CCACCGGTGC CTGTAACCCC
TCCATGACGA TCGCGGCCCT GGCCGAGCAC AGCATGTCCA CGATCATCCG TGAGGACGTC
GGACGTGTCT TCTGA
 
Protein sequence
MSDNASTPLE SGPGLTRRRI LGTVALGAAA AGGLAAGPGS GRARAATAPA DAAVTEQRER 
AVVIGSGFGG GVTALRLAQA GVPTLVLERG LRWSTGPNAT TFCRWANIDN RSAWLTDHST
TPGVDKTWTP YTGVIESLPG DGITVNCGAG VGGGSLVYHG MTLQPTSQNF AASIPEAASL
YRDLNRWAYP TVATMLGVST VPNDVLNADQ YRSSRLFLDV AANSGLEPFR VPLPVDWSYV
RGELTGQYEP TYTTSDIVFG VNNGGKHSID VTYLASAEAT GRVRVSPLHV VRDVQMDPDG
RWVLSVDRID TGGTVQERKR ITADAVFLNA GSAGTTRLLV KARAKGLVPD LPDAIGTKWG
NNGDRIYAWV GMNGDPGTRQ GGPACVGGRD TQGPIPATVI HAGAPADTGG VKLMTVVGFG
IVDAAGTWAY DPNTDDARLT WPATGDAALQ TQIAARMQAI VAAGGGMMID TNAQANSTWH
ALGGVPMGSA VDLYGRIIGH KGLYVLDGSR IPGSTGACNP SMTIAALAEH SMSTIIREDV
GRVF