Gene Franean1_4699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4699 
Symbol 
ID5673041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5613718 
End bp5616051 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content69% 
IMG OID641243556 
Productcarbon-monoxide dehydrogenase (acceptor) 
Protein accessionYP_001508972 
Protein GI158316464 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR02416] carbon-monoxide dehydrogenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTCG TCGGCGAGCG GGTGGCAAGA GTCGAGGACC CGCGGTTGTT GACGGGCCGT 
GGAAAATTCG TCGACGACGT GACGTTGCCG CGCATGCTCC ATGTCGCTTT CGTACGCAGC
CCCATGGCCC ACGCTCGGAT CAACAGCATC GACACCACGG CGGCGCGAGC GGTGCCCGGC
GTCGTGGCGG TGTTCACCGG CGCCGACCTG GTAGCCGGCT GCAATCCCCT CACGGGATTG
CGCAACGGAC CGAACTGGCC GACGTTCCGC GCCTTGCCCA CGGACAAGGC AAGGTTCGTC
GGCGATCCGC TCGTGATGGT CGTGGCCGAC AGCCGGTACA TAGCCGAGGA CGCGTGCGAG
CTGGTCGACA TCGACTACGA GCCGCTTCCG GCGGTGACCG GTTACGAAGC TGCCCTCGAC
CCCGCATCAC CGGCCATCTT CGACGAACTC GGCGACAACA TCGTGGCGAC GAGCCCACCC
GTCACCCTGG GTGACATCGA TGCCGCCTTC GCCGAGGCCG ACCAGGTGGT GCGGGCGACC
TTGCGCCAAC ACCGTGTCGC CAACGTCCCC CTGGAAACGA GGGGCGGGAT CGCCGACTAC
GACCCGGCCT CGGGGGAGCT GACCTTCATC GCCTCGACCC AGACCCCGCA TGGCCTGCGG
CAGGCACTGG CCCAGGCTCT CGACCATCCC CTGGAACGGC TGCGGGTGCT TGCGGGCGAC
GTGGGTGGTG GCTTCGGCCT CAAGGGTGTG GTGGGGCGCG AGGACTTCTG CATCGCGCTG
GCCAGCAAGC GGCTCGGCCG TCCGGTCAAA TGGGTGGAGG ACCGCAACGA GCACCTTCTC
GCGTCCGGCC ACGCCCGCGA AGAGAAGATC GACGCGGAAC TCGCGCTGAA GGCCGACGGC
ACGCTGCTCG GGTTGAAGGT GAAGCTCGTC CTGGACGGCG GCGCCTACCC GGCGGTGCCG
TTCAGCTGCA CGATCTACCC TGAGATGATC TGTACGTCGC TGCCTGGCCC TTACCACATC
AAAGCCTACA AATATGAAAG CGCTGTCGTC GCCAGCAACA AGGCTACGTA CGTCGCCTAC
CGGGGCCCGT GGGAGATGGA GGTGTGGACG CGGGAACGGC TGCTTGACAT CGCGGCCCAC
GAGCTGGGGC TCGACCCCGC CGACATCCGG CGCCGGAACC TGGTGGCCGG CGAGCCCGGC
GACCAGATCA TCACTGGCCG AGGCCTGGAG GGCATCACAT CACGCCGCTC CCTTGAGCAG
GCCCTGGACC TCGTCGATTA CGACGGGTTC CGGAAGGAGC AGGTAGCCGC GCGTGCCGCG
GGCCGCTACC TCGGCATCGG GTTCGCCATC TTCATGGAAG CGGCGCCCGG GCCGCCGGAG
TTGCGGGGCA AAGGTGCGCC CTTCGGCGGC GAACAGGCGA AGGCGGCGCT CCAGGCCGAC
GGCCATCTCC TCGTGACCAC CGGTCAGGCC CCGCACGGCC AGGGCCACGA GACGACGCTC
GCCCAGGTCG CGGCCGACCA GATGGGTATT CCGCTGGACC ACGTGCGGGT GGTCCACGGC
GACACGCGTC AGACCCCGTT CAACCTCATC GGCACCGGCG GCAGCCGGGC CGGCACGTGG
GCGACGGGTG CCGTGATCGT GACGACCAGG CGGCTGAAGG AAAAGGTGCT GGACATCGCG
GCCCACCGGC TGCAGATCGA TCCTGGCGAC CTCGACATCG TCGACGGGAT GGTGACGCCG
AAGGGGGCGC CGGACAAGGC GATCCCGCTG GCCGACGTGG CGAAGCTGGC GATGATGGCG
CCCATGTTCC TCCCGCCGGG CACCGACGTG TCCTTGACGG CGCAGGAGCG CTTCGACGGG
TCCGCCGTCA CCAGCAGCGG TTGGTCGGGC GGCGCACATG TCTGCACCGT CGAGGTGGAC
ATCGCCACGG GGCAGGTCCG GATACTTCGT TACGTCGTCG TCGAGGACTG CGGACGGGTG
GTCAACCCGG CGATCGTCGA GGGCCAGATC TGCGGCGGTA TCGCGCAGGG CATCGGCGAG
GTGCTGTACG AGAACGCCGC CTACGACGCC GACGGCAACT TCCTCGCCGC CACGTTCATG
GACTACCTGC TCCCGACCGC CGCCGAGATC CCGCACATCG AGATCGAACA CATCGATTCC
GCAGGCCTCG GCGACTTCGA CTTCCACGGT GTCGGCGAAG GCGGGGCGCT GGCCGCCCCG
GCGACGCTGA CCAACGCCGT GGCGGACGCT CTGCTGCCCT TCGGAGCCCG GGTGGTCGAC
CAGTACCTTC CCCCAGCGAA GATCCTCGAG CTCGCGGGCG TGATCTCTGC CTGA
 
Protein sequence
MRFVGERVAR VEDPRLLTGR GKFVDDVTLP RMLHVAFVRS PMAHARINSI DTTAARAVPG 
VVAVFTGADL VAGCNPLTGL RNGPNWPTFR ALPTDKARFV GDPLVMVVAD SRYIAEDACE
LVDIDYEPLP AVTGYEAALD PASPAIFDEL GDNIVATSPP VTLGDIDAAF AEADQVVRAT
LRQHRVANVP LETRGGIADY DPASGELTFI ASTQTPHGLR QALAQALDHP LERLRVLAGD
VGGGFGLKGV VGREDFCIAL ASKRLGRPVK WVEDRNEHLL ASGHAREEKI DAELALKADG
TLLGLKVKLV LDGGAYPAVP FSCTIYPEMI CTSLPGPYHI KAYKYESAVV ASNKATYVAY
RGPWEMEVWT RERLLDIAAH ELGLDPADIR RRNLVAGEPG DQIITGRGLE GITSRRSLEQ
ALDLVDYDGF RKEQVAARAA GRYLGIGFAI FMEAAPGPPE LRGKGAPFGG EQAKAALQAD
GHLLVTTGQA PHGQGHETTL AQVAADQMGI PLDHVRVVHG DTRQTPFNLI GTGGSRAGTW
ATGAVIVTTR RLKEKVLDIA AHRLQIDPGD LDIVDGMVTP KGAPDKAIPL ADVAKLAMMA
PMFLPPGTDV SLTAQERFDG SAVTSSGWSG GAHVCTVEVD IATGQVRILR YVVVEDCGRV
VNPAIVEGQI CGGIAQGIGE VLYENAAYDA DGNFLAATFM DYLLPTAAEI PHIEIEHIDS
AGLGDFDFHG VGEGGALAAP ATLTNAVADA LLPFGARVVD QYLPPAKILE LAGVISA