Gene Francci3_1081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1081 
Symbol 
ID3906424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1286517 
End bp1287821 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content77% 
IMG OID637878415 
Producthypothetical protein 
Protein accessionYP_480192 
Protein GI86739792 
COG category 
COG ID 
TIGRFAM ID[TIGR02679] conserved hypothetical protein TIGR02679 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00809426 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCGT CGTCGACCGG GGCCTCCCGC GGTACGGAAG CCGCCCGCGA TGTGGAACGG 
CTGCGTCGTC TGCTGGGCGG TGAGCACACG GCGTGGCTGC TCGACCGGGT GCGGCGCAGG
ATCGAGCTGG GCCGGCCGCT CACCGGGACG GTCACCCTCG GCCGGGCCAG CGCGGACCAG
CGGCGGGGGG TCGAGCGGCT GCTGGGCCGC CGGGCCGGCA CCGGCGCCTC GCTGTCCGTG
TCGTTGGACG AGGTCGACGC GGTGCTGCGG TCCAGCGGCG CCGCGCCGGA CGGACTCGCC
GCCGCCGTCC GCCAGCTCAC CGGGGATGTC ACGGAGCGGG CGGAGCTGGC CGCGGCGCAG
TCACGAGCCT GGGCGCAGGC GCACCAGCCC CTGGACGACC TCCTCGCCCG GCGCCCCGAA
CTCACCCCCT GGCGGTCCTG GCTGGACAGC ACCGGCCTGC TGCGCCGGCT GGCTGGCACC
CCGGACGCGG CGGGGCCGCT GGCCGCCGAC CTGCTCCGGG TGCTCGACGC GTTGCCCTCA
CCCGGCGTGG CGCTGGGCCG GCTGGCCGCC ACCAGCACCG GTGACGCGCA CGCGCTCGAC
GACGGCCGGC CGCTGGCGAC GCTGGCCCTG GCGGCGGCCC GGGTACTGGG CGGATCCCAC
CCGGCGGGTG ACGGCTCGGC CGCGGGCCGT CGCGCCGCGT GGGCCGCTGT CGGGGTGCAC
CGTGACGAAC TGTCGTCCAC GGTGCTCTGC CTGGGTCTGC CGGGCGGGGC GGACAGCCCG
ACCGGCCGGA TCCTGGCCAT CTGTCGCGAG GCGGGGGAGC CCTGCGTGCT CACTCTGCGC
CAGGTCGGCC CGGGCCAGGA CCGCGGCGAC CTCGGGGTGG GCCTCGGAGT GGTGTACCTG
TGCGAGAATC CGATCGTGCT GGCCAGCGCC GCCGACGAGC TGGCGGGCCG CACCCCGCCG
CTGGTGTGCG TGAACGGCCA GCCGTCGGCC GCCGTCATCA GCCTGCTCGC GGCGCTCGCC
GGCCAGGGCG CGCGGTTTGC CTACCATGGC GACTTCGACT GGGGCGGCAT CCGCATCGCC
AACGGCCTGC GGGAACGGAT CGGCTGGCGG CCATGGCGTT TCGACGCCCG GTCCTATCAG
GCGGCGCTGA CCACGGTGAC CGGCGGCGAG CTGGTTGGCC GGCCGGTCGA CGCGGCCTGG
GACCCCGACC TGCGCCCCGC GCTGGAGCAC CATGCGATCA GAGTGGACGA GGAACTCGTC
CTCGCCGAGC TGATCAACGA CCTCGGCGCC CCACACGCCT GGTAA
 
Protein sequence
MSASSTGASR GTEAARDVER LRRLLGGEHT AWLLDRVRRR IELGRPLTGT VTLGRASADQ 
RRGVERLLGR RAGTGASLSV SLDEVDAVLR SSGAAPDGLA AAVRQLTGDV TERAELAAAQ
SRAWAQAHQP LDDLLARRPE LTPWRSWLDS TGLLRRLAGT PDAAGPLAAD LLRVLDALPS
PGVALGRLAA TSTGDAHALD DGRPLATLAL AAARVLGGSH PAGDGSAAGR RAAWAAVGVH
RDELSSTVLC LGLPGGADSP TGRILAICRE AGEPCVLTLR QVGPGQDRGD LGVGLGVVYL
CENPIVLASA ADELAGRTPP LVCVNGQPSA AVISLLAALA GQGARFAYHG DFDWGGIRIA
NGLRERIGWR PWRFDARSYQ AALTTVTGGE LVGRPVDAAW DPDLRPALEH HAIRVDEELV
LAELINDLGA PHAW