Gene Francci3_2076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2076 
Symbol 
ID3904649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2441754 
End bp2442944 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content64% 
IMG OID637879412 
Productluciferase-like 
Protein accessionYP_481178 
Protein GI86740778 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.621068 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCT CTATCTTCCT GAATCCGCAG GTTCCCGGAT CGGGCTACCT TTCTGAACAG 
CACGTCGGGG CGAAGCGACC AATCGGGCGG GACACCGACT CCTACCAGGC AATGCTCCAC
GAGCTGCGCA GCATCGCGGT ACAGGCCGAC CAGACCGGCT TCGACGCTCT GATGCTCAGC
GAGCACCATT TTCATTCTGA GGGCCTAGAG CTCTCGGTCA ACCCGCTGAT GGTGCTGGCC
GACCTGGCCG CGCGAACCGA ACGCCTGCTG CTGGCTCCGC TGGGCATGGT GCTTCCGGCG
TGGGATCCCG TCCGCGCAGC GGAGGACGTC GCCCTTCTCG ACCAGTTCTG TCGCGGCCGG
CTCCGGCTCG GACTGGCCCG GGGTTTCCAG AACCGATGGG TCAACGTCCT CGGCCAACGC
TGGGAGGTGA CGGACGCCAG CGGCGACGGC TCACACACCG ACGACCGCAA CTTCGAGGTG
TTCGGCGAAA TCCTGAAGAT CATGAAGATG TGCTGGACTC AGGAGACCGT GCGATACCGG
AGCGAGATTC TGGACGGCTA TTCGATCCCG CAGCCCTTTG ACGGCATCGC GGACTGGCCG
GCCGTGGAAT GGACCCGCGC GTACGGCGCT GCGGACGAGA TCGACGAGCA GGGCCGCGTC
CGCGCGGTCT CGGTCTGCCC GCGGCCATAC CAGGACCCGT ACCCGGAGCT GTGGCAGCCG
TTCACGATGA GTGACCGGTC GATCGTCCGG TGCGCGCAGG AGGACATCGT CCCGTGGATC
TTCACGCCCT TCCTGAACGA GCATCGGGCG AAGGCTGAGC TTTACCAGTC CGAGTGCGCC
AAGTTCGGTC GGGAGTACAA GCTCGGCGAG CACACCGGCT TCCTGAAGAT GATCGGGCTC
GCCGACACGA CCGAAGAGGC CGCCCGGGTC TTCAATCCCT CGATCCTGAA CGACTTCGCG
ACCTTCTATG GATCGTTTGG CTTCCAGGAC CCGAGCCTGA TTCCGGAGGT CGGCCTGTTC
GGTGCCCCGG ATGACGTCAA GCGAGGCCTG GAAAAGATCT TCAACAGCAC TCCGGACCTG
GAGTGGATCG GGCTCTTCAT GATCGGCCAG CAGGGCGGCC TGCCGCTCGA CCGGGTACTG
CGCAGCCTGG AGCTCTTCGC CGAGGAGATC ATTCCCGAGT TCCGCGACTG A
 
Protein sequence
MKFSIFLNPQ VPGSGYLSEQ HVGAKRPIGR DTDSYQAMLH ELRSIAVQAD QTGFDALMLS 
EHHFHSEGLE LSVNPLMVLA DLAARTERLL LAPLGMVLPA WDPVRAAEDV ALLDQFCRGR
LRLGLARGFQ NRWVNVLGQR WEVTDASGDG SHTDDRNFEV FGEILKIMKM CWTQETVRYR
SEILDGYSIP QPFDGIADWP AVEWTRAYGA ADEIDEQGRV RAVSVCPRPY QDPYPELWQP
FTMSDRSIVR CAQEDIVPWI FTPFLNEHRA KAELYQSECA KFGREYKLGE HTGFLKMIGL
ADTTEEAARV FNPSILNDFA TFYGSFGFQD PSLIPEVGLF GAPDDVKRGL EKIFNSTPDL
EWIGLFMIGQ QGGLPLDRVL RSLELFAEEI IPEFRD