Gene Francci3_2206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2206 
Symbol 
ID3906345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2580477 
End bp2582174 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content73% 
IMG OID637879538 
Producthypothetical protein 
Protein accessionYP_481304 
Protein GI86740904 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.855212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGATCGC AGATTGCGTC GGGACTGCGG CAGGACAGGC GCCCGACCGG CGGCCGCATC 
GTCCCCTCCG GCATCGTCCG GTCGGGCATC GTCCGGTCGG GCGCCGTGCG ATCCCGTCCC
GTGTCACCCC GTGTCGCGTT GACGGCCGTC GCGCTGGTCG GCCTGATCGG GCTGGTCGGC
ACCGGTTGTG ACGGCTCATC GCCCGCGGCG GGTAACCGGC CGTCGGCCGG TACCGACGCG
TCGGGCGGGT CCGGGAGACC CACGGAGCCG GACACGCCCG CCGGGTTCGA CGTCAAGGCC
GAAAACGCCC GGCCCGGCGC CGACTGCGGG CTGGCCCGGC TCGGCGAGGG CCACGCGGTC
GAGGGCTGGC TGGACCGGGT CGGCGTCGAC CCCGGTCAGC CCGTACGGTT GTTCGCCTCC
ACCACGGCGA GCCGGCTCGC CGTCTCGGTG TTCCGGATCG GCTGGTACGG CGGCCACACC
TGCCGGCTCA TCACCCGGCG GGAGGAGCTG CCCGGCCGGG TCCAGCCGTC GGCGGCGATG
AACGCGACGA CGAACACCGT GTCGGCCGCG TCGTGGGCCC CGACCGTGAC CTTCGACACG
GCGACGTGGC CGCCGGGCGA CTACCTGTTC CGCCTCGACA GCAGCAACGG ATTCCAGTCG
TACGTGCCGT TGACGGTGCG CAGCCCGTCC GCGCAGGGCA GGATCGTGAT CCTCAACTCG
GTCACCACCT GGCAGGCGTA CAACGCCTGG GGAGGCTACA GCCTTTACCA CGGGCTGCGG
GGTTTCGCCG ACCGCGCCCG CGTCGTCTCG TTCGACCGGC CCTACGGCTA CGGCGACGGT
GCCGCGGACT TCACCGGGAA CGAGGCGCCC CTGGTGACGC TGGCAGAGCG GCTCGGCCTG
CCGCTCGCCT ATGCCACCGA CATCGACCTG CATGCCGAGC CGCGGCTGTT CGACGGTGCC
CGCGCGGTGA TCTCGCTGGG GCACGACGAG TACTACTCGC CGCGGATGCG GGCGACGCTG
ACCGCCGCCC GGGACGCCGG GGCGAACATC GCCTTCCTCG GCGCCAACGC CGTATACCGC
CGGATCCGGC TCGCGCCCAC GCCCCATGGG CCCGACCGGC TGGAAACGGG CTACAAGGTC
GCCAACGAGG ACCCCCTGTA CGGGCGGGAC AACAGCCAGA TCACCGCGAA CTGGCCGTCG
CCGCCGAACG CCGACCCGGA GAGCTCGCTC ACCGGCGGCA TGTACCAGTG CAACCCGGTG
CACGCCGATC TGGTGGTGAC CAACCCCGGT CACTGGCTGC TGGCCGGCAC GGGGCTGGCG
GCGGGCAGCC GGATCCCGGG CATGATCGGG TCCGAATATG ATCGCGTCGA TCCCAACCGG
CCGACGCCAC AGATGATCGA AGTGCTGGCC CACTCGCCGG TGGCCTGCCA CGGTCAGGCC
GACTACTCCG ACGTCAGCTA CTACACGGCG CCGTCCGGGG CCGGTGTCTT CGACGCCGGC
ACCAGCGCGT GGGTGTGCGC CCTGCTCGAT GTCTGCGGGC CGGGCGCCCA TGGGGAGCCG
GTGCAGCGGT TCGTCACCCA GGTCACCACC ACCCTGCTGC AGGCCTTCGC CGCGGGGCCG
GCCGGGCGGG TCCACCCCGC GGCCCGGTCC GTCCCGGCCG AGGCCCGCAG CACGCCGGTC
ACCGGCAGCG AACGCTGA
 
Protein sequence
MRSQIASGLR QDRRPTGGRI VPSGIVRSGI VRSGAVRSRP VSPRVALTAV ALVGLIGLVG 
TGCDGSSPAA GNRPSAGTDA SGGSGRPTEP DTPAGFDVKA ENARPGADCG LARLGEGHAV
EGWLDRVGVD PGQPVRLFAS TTASRLAVSV FRIGWYGGHT CRLITRREEL PGRVQPSAAM
NATTNTVSAA SWAPTVTFDT ATWPPGDYLF RLDSSNGFQS YVPLTVRSPS AQGRIVILNS
VTTWQAYNAW GGYSLYHGLR GFADRARVVS FDRPYGYGDG AADFTGNEAP LVTLAERLGL
PLAYATDIDL HAEPRLFDGA RAVISLGHDE YYSPRMRATL TAARDAGANI AFLGANAVYR
RIRLAPTPHG PDRLETGYKV ANEDPLYGRD NSQITANWPS PPNADPESSL TGGMYQCNPV
HADLVVTNPG HWLLAGTGLA AGSRIPGMIG SEYDRVDPNR PTPQMIEVLA HSPVACHGQA
DYSDVSYYTA PSGAGVFDAG TSAWVCALLD VCGPGAHGEP VQRFVTQVTT TLLQAFAAGP
AGRVHPAARS VPAEARSTPV TGSER