Gene Francci3_0311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0311 
Symbol 
ID3903343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp358220 
End bp359449 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content61% 
IMG OID637877640 
Producthypothetical protein 
Protein accessionYP_479427 
Protein GI86739027 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR02032] geranylgeranyl reductase family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.172492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCAGAG GAAGCTACGA CGTGGTGATC GTGGGCGCGG GCCCGGCTGG GAGCGTGGCC 
GCCTTCGCAT TGAAGCGACG GAACCCCCGC CTTCGCGTAC TCCTGACCGA CAAGGCTGTT
TTCCCGCGTG ATAAGGCATG CGGTGACGGC CTCGGCGCAG GGGCAGTTGC GGCCCTGCGC
CGCCTAGGCC TGCTTGGTGT TGTCCATGAC GCCATTTCAC CACTGAGCGT GCGAGTCAGC
GGACCTGACG GAACCGAGGC GACGGCGGTC GGCCCGACAG TCGCCGGCCG CGATCTTTCC
GGATACGTAC TTCCACGTGA AATCCTCGAC GCGCGTCTCG TTGCTGCGGC ACGCGAGGTC
GGAGTCGAAA TGCGAGAAGG AACATCATAC CATTCATCGG AGCTAACCGG CAACAGCCGG
GTTATCACAT TCAAGACCGG TTCGAGATCG AACTCCGTCG AGGCAGCCCT GATGATCGGT
GCCGATGGTG CCTATTCTCG GGTACGACGC GATCTTGGAG TCGGCCGCCA GGATGACCGA
TTCTCCTCGA TCGCCATGCG CTCATATGCC AAGTTCTCGG ATCCGCGCCG CGCCCCGCAG
GAGGTTATGC CGCTCCGACT CGACTTCAGT GACAAGGTAC TTCCTGGCTA CGGATGGGTG
TTTCCTGTCT CCGACGATGT CGTCAATCTC GGTGTGGGTC TACCGGTATC CACTATGCGG
GAGAAATCGT TGAACCCGCA CAATCTGCTC AGCGCCTACG TGGAGGATCT ACGGGCCCGA
GGGTTCATCG TCGAAGAGCC GAACAAGCTA CTCTCACATT ATCTCCCGCA TGCCGGAAAA
CTGGCTCCGA TGGCGCATCC CCGCGCCGCA CTCATCGGAG ACGCCGCCGC GAGCATCAAC
CCTCTCAGTG GCGAAGGTAT TGTCTACGCG ATGGTAGCGG CGGAGATGCT CGCCGCAGCC
CTCGAAGGCT GGAATGGGCT GGGCCACGCA GAGCTCAGCG GTGGCCTTCA GCTCTTTGAG
CAGAAATTTA GACGGCGCTT TCGATTGCAC TTCGCCAGCT GTACAGTGGC AAATATACTC
ATGGGCCACA GGAGATGGGC TAATCTGGTT ATCCGGGCAG CGTCTCGAGA CAGCCACGTT
ATGGACGCAG CATCTCTTAT GCTCTTCGAT GAACGCCGTA TGTACCTGTC CACAGGGGCC
CGAATCCTCA TCCGCGGCAC ACGGAGATGA
 
Protein sequence
MVRGSYDVVI VGAGPAGSVA AFALKRRNPR LRVLLTDKAV FPRDKACGDG LGAGAVAALR 
RLGLLGVVHD AISPLSVRVS GPDGTEATAV GPTVAGRDLS GYVLPREILD ARLVAAAREV
GVEMREGTSY HSSELTGNSR VITFKTGSRS NSVEAALMIG ADGAYSRVRR DLGVGRQDDR
FSSIAMRSYA KFSDPRRAPQ EVMPLRLDFS DKVLPGYGWV FPVSDDVVNL GVGLPVSTMR
EKSLNPHNLL SAYVEDLRAR GFIVEEPNKL LSHYLPHAGK LAPMAHPRAA LIGDAAASIN
PLSGEGIVYA MVAAEMLAAA LEGWNGLGHA ELSGGLQLFE QKFRRRFRLH FASCTVANIL
MGHRRWANLV IRAASRDSHV MDAASLMLFD ERRMYLSTGA RILIRGTRR