Gene Franean1_4426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4426 
Symbol 
ID5672778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5287771 
End bp5289510 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content70% 
IMG OID641243295 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001508711 
Protein GI158316203 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGTT CCTCTTCACA CCGTTCGCTG CTCGACTTCT ACGCCGACCC CGACGCGGGC 
AGCTGGGATG AGGTCGGGTA CGTCCCCTAT CCGCTCGGGC CCCCGGCCGT CGACACACCG
GTCGTGGCCC GCCTGCGGAC GCGGGCGTTG TCTGACCTGC GTGAACGTTA CGACGCGGTC
GTTGTGGGAA GCGGAGCCGG TGGCGGTGTC GCCGCATTCG TGCTCGCGTC CGCGGGAGCA
TCCGTGCTCG TCGTCGAACG CGGCCAGTGG TACGGCGCGG GCGACCTCGC TACCGACCAT
ATCCGCAACC ACCGGTTCTT CCTCGGGGGC GACCTCGACA CACCCCCCGG GCACCCCCGG
GCAGTCCTCG GCGACGACGG TGAGATCGCG GTCGAGTCCG AGGACGGCCG CTACCACCAC
AACGCAATCA CCGTCGGAGG TGGCACCCGG TTCTTCGGCG CCCAGGCCTG GCGGTTCCGG
CCCGAGGACT TCCGGATGGC CTCCCTGTAC GGCGTCCCAG AAGGATCCGC GCTCGCGGAC
TGGCCGATCA CCTACGGTGA CCTCGAACCG TTCTACGACC AGGTGGAATG GGAACTCGGC
GTGGCCGGCC GCCCGCACCC CGATGACGCG CACCGCAAAC GCGACTACCC GATGCCCCCC
TTCCCGCCGA GCGCCCTGGC TGCCGCGCTC GCGGCCGGCG CCGACAGGCT CGGATGGCCG
ACGGGACCAA CCCCGTTGCT GCTGAACACC CGACCCCGCG CTGGCCGGGC CGCATGTATC
CGCTGCGGGA TGTGTGTCGG CTTCACCTGC CCCGTCGACG CCCGCAACGG CACCCACAGC
ACCGTCCTGC CCCGCGCCGT CGACCTCGGC GCCGACCTGA CTGTTGGCGC GCAGGTCACG
CGAGTATCCG CCGCCGGCGA CGTCGAGATC GCCGCCGGCG GCGCGTCACG CGTCATCCGC
GCCGGAACGG TCGTGCTCGC CGGCGGCGCG GTCGAGACCG CCAGGCTGCT CCAGCTCAGC
GGCCTGGGCA ACGACTGGGT CGGCGACTGC CTCCAAGGAC ACCTCTACGC AGGCGCGCTC
GGCGTGTTCG ACGAGCCGGT CCACGACGGA CTCGGACCAG GCCCGTCCAT CGCGACCAGA
CGCTTCGCCC ACGGAAACGA CAGCGTCGTC GGTGGAGGCC TGCTCGGGGA TGACTTCGTC
AGGCTCCCCG TCATGCACTA CTTCATGACC CGCCTCATGG GACTTTCCCC CGACGTCAGC
CAGGCGGACG TGCGTCGCGC CATGGCGGAG AGTTACCGCC ACACCGGCAT CGTCTCCGGG
CCGGTCCAGG AAGTGACGAT TCGCGGCGCA CGCGTCCGGT TGGCGTCAGG CGTGACCGAC
CACCTCGGCC TCCCTGTGGC CCGACTCGAA GGTTTCCATC ATCCCGAGGA CCTCCGAACC
GTCGCGTTCC TCACCGACCG AGCCGAGGAA TGGCTACACG CATCCGGCGC TCGCCAAACC
TGGCAGATAG GCCCGATGAA GGAAACGACA CTATCCGCGG GCCAGCACCA GGCAGGCACC
GCGCGCATGT CCGACTCACC CCGCCACGGC GCAACCGACC CGTTCGGCAA AGTCTGGGGA
ACAGATCGCG TATACGTCGC CGATGCAAGC CTCCACGTCA CCAATGGCGG CGCAAACCCC
GTCCTCACCA TCATGGCCCT CGCCTGGCGA ACCGCAGCAC ACATCGCCGC CAGCGGATAA
 
Protein sequence
MNRSSSHRSL LDFYADPDAG SWDEVGYVPY PLGPPAVDTP VVARLRTRAL SDLRERYDAV 
VVGSGAGGGV AAFVLASAGA SVLVVERGQW YGAGDLATDH IRNHRFFLGG DLDTPPGHPR
AVLGDDGEIA VESEDGRYHH NAITVGGGTR FFGAQAWRFR PEDFRMASLY GVPEGSALAD
WPITYGDLEP FYDQVEWELG VAGRPHPDDA HRKRDYPMPP FPPSALAAAL AAGADRLGWP
TGPTPLLLNT RPRAGRAACI RCGMCVGFTC PVDARNGTHS TVLPRAVDLG ADLTVGAQVT
RVSAAGDVEI AAGGASRVIR AGTVVLAGGA VETARLLQLS GLGNDWVGDC LQGHLYAGAL
GVFDEPVHDG LGPGPSIATR RFAHGNDSVV GGGLLGDDFV RLPVMHYFMT RLMGLSPDVS
QADVRRAMAE SYRHTGIVSG PVQEVTIRGA RVRLASGVTD HLGLPVARLE GFHHPEDLRT
VAFLTDRAEE WLHASGARQT WQIGPMKETT LSAGQHQAGT ARMSDSPRHG ATDPFGKVWG
TDRVYVADAS LHVTNGGANP VLTIMALAWR TAAHIAASG