Gene Franean1_3211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3211 
Symbol 
ID5671587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3788847 
End bp3791216 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content70% 
IMG OID641242105 
Productcarbon-monoxide dehydrogenase (acceptor) 
Protein accessionYP_001507525 
Protein GI158315017 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR02416] carbon-monoxide dehydrogenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAACT CCACTGTCAC GACAGCCGCC GCCACGCCGA CCGCCGCGCG GTACGCGGGC 
ACCCGCGTGA ACCGGGTCGA GGACCCGCGG CTGCTTACCG GCCAGGGCAC CTATGTCGAC
GATGTCAAGC GGCCGGGGAT GCTGCACGCG TGCTTTGTAC GCAGCCCGTT CGCCCGCGCG
CGCATCATCG GCATCGACGC GTCGTCGGCG TTGGAGCTCG AGGGCGTGCA CGCCGTGTTC
GTGGCCGAGG ATCTCAACCC GGGCGTCCAC GAGCACTGGT TCGGGATCAT CGGTCGCGAC
GTCCCGGACA CCCCGCGGCC GGCGCTGGCC CAGGACGAGG TCCGCTTCGT CGGCGACCCG
GTGGCCCTCG TCATCGCGGA GGACCGCTAC ATCGCGGAGG ACGCGGTTGA GCTGGTGGAG
GTGGATTACG ACCCGCTGCC TCCGGTCGCC TCCTACATCA GCGCCCAGGA GTCGACCGAG
CTCGTGCACG AGGCCTACCA GAACAACCGG GCCGGACTAC TCAAAGGCGG CGACCAGGGG
CGGGTCGACG AGGCGTGCGC CGGCGCGGCG TTCGTCGTCG AGGAGACGAT CTACCAGCAG
GGGTACGCGC CCGTCCCGAT GGAGACCCGC GGGATCGTCG TGGAGTGGTC GGGTGGCGAG
CTCACGATCT GGGCGGCGTC CCAGGCCCCC CATGAGCTGC GGGCCTTCGC CGCGCGGTAC
CTCGGCCTGC CGGAGAACCG GGTTCGCGTG ATCATGCGGG ACGCGGGCGG GGGCTTCGGG
CAGAAGGTCA ACCCGATGCG CGAGGACATG TGCATGCTGC TCGCCGCCCG GAAGGTTCCC
GCGGCGCTGA AGTGGATCGA GGACCGGCGC GAGAACCTCA TGGCCGCGAA CTCGGCCCGC
CACGAGCACG GTGCCGCCCG TCTCGCGTTC GACGCGGAGG GCAGGATCGT CGCGGCGCAG
ATCGACCATG TCCAGGACGT GGGTTCCTAC CCCACACCGT GGCCGGTGGG TACCGCGGTG
GCGGTCTGCA TGATGTTCCC CGGGCCGTAC CGCATCCCGG TCGCGGCGTG GTCGTCGGCG
TCGGTGTTCT CGAACACGCT GGGTCGTGGC GCCTACCGGG GGCCGTGGCA GTACGAGTCG
GTCGCTCGCG AGGTACTGCT GGACGTCGCC GCGCGCCGGA TGGGCCTCGA CCCGGTCGAG
CTGCGCCGGC GCAACTTCCT GGCCCGCGCG GACCTGCCGT ACGTGAACCC GTGCGGCATG
CCGTACGACC ACATGTCCCC GCGTGAGGTG TTCGAGAAGG CACTGGAGCA CTTCGACTAC
GACGCGTTCC GCCGTGAGCA GGCCGAGGCC AGGGCGAACG GCCGGTACAT CGGTATCGGG
ACGTGCAGCT ACGTGGAGCC CACCACGACC GGGATGTCCT TCTACGCGAC CGAGGGCGCC
ACGATCCGCA TCGAGCCGAG TGGCACGGTC AACGTGTACC TCGCTGGCGG ATCGACGGGC
AACAGCCTGG AGACCACCGC CGTACAGATC GCGGCCGACG CGCTGGGTGT CGACATCAGG
GACGTCAACA CGATCCAGGG AGACACCGCC GTCACGCCCT TCGGCGGGGG CACCGGCGGC
AGTCGCAGCG GTTCGATGAT CGCCGGGGCG GTCGGAGTCA CCGCGGGCGA ACTACGCGAG
CGCATCATCG CCATCGCGGC GCACCGTCTG GAGGCGGCGG CCGAGGACAT CGAGCTCGCG
GACGGGCGGG CCAATGTGCG TGGTACCCCC TCGATCGGCA TGTCGCTCGC CGAGATCGCG
AATGTCGCGT ATTTCGATCC CGCCGGGCTG CCGCCCGGTG TGCAGCCCGG ATTGGAAGTC
AGCGGCCGGT ATCAGGCGCA GGCGCCGATG CTGTGGGCCA ACGCCACGCA CATCTGTACC
TGCGAGGTGG ACACGGAGAC CGGGGTCGTG ACCTTCCTTC GGTACCTCGT CAGCGAGGAC
TGCGGCCCGC GGATCAACCC GAGCATCGTC GAGGGCCAGG TCGACGGTGG AACCGTGCAG
GGCATCGGCG GGGCTCTCTA CGAGGACCTG GCCTATGACG AGGACGGCAA CCCGGTCGCG
ACGACGTTCA TCGACTACCT GTTGCCGACC ATCGCCGAGA TGCCGCTCAT CGAGCACGTG
CACCTCGAGA CCCCGGGACC GGGGCCGGGC GGCTACAAAG GCGCCGGTGA GGGCGGCGCG
ATCGGCGCGC CCCCGGCAGT CATCAACGCG GTCGCCGACG CGCTCGCGCC GTTCGGGGTC
TCCATCACCC ACCTTCCGCT CACGCCCGCG ACGATTGTCG CGCTTCTCGA CGAGGGGAAG
TCCGGCCAGG AGGACCAGCA GCCTCACTGA
 
Protein sequence
MGNSTVTTAA ATPTAARYAG TRVNRVEDPR LLTGQGTYVD DVKRPGMLHA CFVRSPFARA 
RIIGIDASSA LELEGVHAVF VAEDLNPGVH EHWFGIIGRD VPDTPRPALA QDEVRFVGDP
VALVIAEDRY IAEDAVELVE VDYDPLPPVA SYISAQESTE LVHEAYQNNR AGLLKGGDQG
RVDEACAGAA FVVEETIYQQ GYAPVPMETR GIVVEWSGGE LTIWAASQAP HELRAFAARY
LGLPENRVRV IMRDAGGGFG QKVNPMREDM CMLLAARKVP AALKWIEDRR ENLMAANSAR
HEHGAARLAF DAEGRIVAAQ IDHVQDVGSY PTPWPVGTAV AVCMMFPGPY RIPVAAWSSA
SVFSNTLGRG AYRGPWQYES VAREVLLDVA ARRMGLDPVE LRRRNFLARA DLPYVNPCGM
PYDHMSPREV FEKALEHFDY DAFRREQAEA RANGRYIGIG TCSYVEPTTT GMSFYATEGA
TIRIEPSGTV NVYLAGGSTG NSLETTAVQI AADALGVDIR DVNTIQGDTA VTPFGGGTGG
SRSGSMIAGA VGVTAGELRE RIIAIAAHRL EAAAEDIELA DGRANVRGTP SIGMSLAEIA
NVAYFDPAGL PPGVQPGLEV SGRYQAQAPM LWANATHICT CEVDTETGVV TFLRYLVSED
CGPRINPSIV EGQVDGGTVQ GIGGALYEDL AYDEDGNPVA TTFIDYLLPT IAEMPLIEHV
HLETPGPGPG GYKGAGEGGA IGAPPAVINA VADALAPFGV SITHLPLTPA TIVALLDEGK
SGQEDQQPH