Gene Francci3_4514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4514 
Symbol 
ID3907491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5388473 
End bp5389552 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content69% 
IMG OID637881847 
Producthypothetical protein 
Protein accessionYP_483589 
Protein GI86743189 
COG category[S] Function unknown 
COG ID[COG4301] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03438] probable methyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.143939 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCAA GGACTCCGAC AACCACCCGA GCGGATGATG CCGTCCCTGA GGGGACGGGC 
GCGAACGGCA CGGCGGAGAT CCCGTTCCCG GGCCTCTCCG TCGACCGCCA CCTGCACGAG
TCGGATCGCG CCGTCACGCT GGCCCGCGAC GTGCTCGCCG GCCTCACCGC GAGCCCCCGC
GAGCTGCCGC CGAAGTGGTT CTACGACACC ACCGGCGGCA TGCTGTTCGA CAAGATCACC
CGGTTGGCGG AGTACTACCC GACCCGGCGG GAACGGGCCA TCCTGGTCAG TTACGCCGAC
GAGATCGCGG CTGCCTGTCC GGCCGACACC CTCGTCGAGC TGGGATCGGG AACCTCCGAC
AAGACCAGGC TGCTGCTCGA CGCGCTGCGG GCGGCCGGCA ACCTCCGCCG TTTCATCCCC
TTCGATGTCG ACGAGGCGGC CCTGGTGCGG GCAGGTCGCG GAATCCTCAC CACCTACCCG
GGGCTCTCCG TGGCGGCCGT GGTCGGGGAC TTCGAACGGC ATCTGCACCA GTTGCCCCGA
GGTGGCCATC GGTTGCTCGC CTTCCTCGGC GGCACGGTCG GCAATCTGCG CCCTGCGCAG
CGGCAGAACC TCCTCCGTAC GCTGCGCGAG CAGGCCGAGA GCTCCGACGC CCTACTACTG
GGAACGGACC TCGTCAAGGA CGTCGACCGC CTCGTCGCCG CGTACGACGA CGGTGCCGGG
GTCACCGCCG CTTTCAACCG CAATGTGCTG ACGGTGATCA ACCGGGAACT CGGGGGCGAC
TTCGACATCC GCGGATTCGC GCATGTGGCG GTCTGGAACG CCGAGAACTC CTGGATCGAG
ATGCGGCTAC GTTCCGTTCG GGAGCAGCGC GTCTCGATCG CGACCCTCGC CGTTACGGTC
GACTTCCAAC CGGGGGAGGA GATCCTCACC GAGATCAGCG CTAAGTTCAC CCTGGACGGC
ATCGCCTCGG AGCTTGCGCT GGCGGGCTGG GCGGTGGCGC GGCAGTGGAC GGATCCCGAT
GGGGATTTTG CCGTGACGCT CGCGACGCCG TCGTCCGCCC CCGCCGCCGT CCCGGTCTGA
 
Protein sequence
MTPRTPTTTR ADDAVPEGTG ANGTAEIPFP GLSVDRHLHE SDRAVTLARD VLAGLTASPR 
ELPPKWFYDT TGGMLFDKIT RLAEYYPTRR ERAILVSYAD EIAAACPADT LVELGSGTSD
KTRLLLDALR AAGNLRRFIP FDVDEAALVR AGRGILTTYP GLSVAAVVGD FERHLHQLPR
GGHRLLAFLG GTVGNLRPAQ RQNLLRTLRE QAESSDALLL GTDLVKDVDR LVAAYDDGAG
VTAAFNRNVL TVINRELGGD FDIRGFAHVA VWNAENSWIE MRLRSVREQR VSIATLAVTV
DFQPGEEILT EISAKFTLDG IASELALAGW AVARQWTDPD GDFAVTLATP SSAPAAVPV