Gene Francci3_2884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2884 
Symbol 
ID3906015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3399255 
End bp3400943 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content73% 
IMG OID637880205 
Producttransposase IS66 
Protein accessionYP_481971 
Protein GI86741571 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000946302 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.757994 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGGG TTGTGGCTGA GCTGTCCGCG TCGTCGTGGG AGTCGCGTCT CGCGGCTGCC 
GAGGCGCGGA TCGAGGAACT CGCCGTCGGG CAGGCCGCGT CGGTGGCGGC GAACGAGCGG
TTGGTGTCGG TCAACGAGCG GCTGCGTCGG GTCGTGGAGG ACTCGGCCGC GCGGCATGAG
GTGGAGCTTG GCGCGGTGCG CGCCGAGCGG GACCGGGCCG TGCGGCGGGT CGAGGAGCTT
GAGCTCGAGT TCGCCGAGCT GCGTCGGCGG TTGTCGATGG ACAGTACGAA CTCCTCTGTG
CCGCCGTCGA AGGAGCCGGT CGGGGCGAGG GAGAAGCGGA AGGCCGAGCG GTGGGCGTCC
TCGGAGCGGG TCCGGTCGAA GGACAGGAAG CCGGGTGGGC AGCCGGGCCA TCCGGGGGCG
GGGCTGGCGC GGGAGGAGAC GCCGGACCGG TCGTTGTCGG CGGACCCGCC GTCGGAATGC
TCGTCGTGCC AGGCGGATCT GTCCGACGCG GGGGTGCTGG CGGACGGGTG GGCGCAGGTC
TGGGACCTAC TCGAACCGGT GCTGGAGAAG GTCGAGTGGG TGTTGCCTCG GCGGCGCTGC
GCGTGCTGCG CGAAGGTCAC CACCGCCGTG GTTCCGGGGG TGCGGCACGC GGCGGCGGGC
GCGGTGTCCT ACGGGCCCAG GCTGCACGGC GCGGCGGTGC TGCTCGCCTC CGAGGGCAAC
GTGCCGGTCG AGCGGGCCGC GATGGTCATC GGCGCGCTGC TCGGGGTGCC GGTGTCGGCC
GGGTTCGTCG CCCGCGCGAA CGCCCGGCTC GCCCAGGACC TGAAGGCCGC CGGGTTCGAC
GAGGCGATGA AGGCGGCGCT GCGGGCCGAG CCGGTGCTGT GCGGGGACGA GTCGCCGGTC
AACGTGCTGC GCAAGGATCG TGACGAGGCC ACCGGCACCG CCCTGTCGGG CACCCCGCAC
CTGCTGGTGG TGCGCACCCC GGCGCCCGGG CTGGTCTGGT ATGCGGCGAT CAGCTCCCGC
TCGTCCGGGG CGATCGACGC CACCGGCGTC CTCACCGGCT GGCACGGCTA CCTGACCCGC
GACGACTACG CCGGCTGGCA CCAGTACGAC CCCACGCTCG CCGGCGTGCA GTTGTGCTGT
GCCCATCTCA TCAGGGCGTT GCGTGCGGTG CTGATCCTCG CGCCGAAGGT CCAGAAGTGG
GCGGGCCGGC TCATCGACCT GCTCCGCGAG GCCAACGGCC TGGTCGTCGC CGCCCGCGCC
GCCGGGAACA GCCGCCTCGG CCAGGCCGCG ATCGACGCGC TGCGGGCCCG CTACGACGCC
GACGTCCGGA TCGGCGAGCT GACGAACATG TCCCGCCCCT GGGCCGACGA GAAGAACCAT
CCCGGCCTGG TCCTGGCCCG CCGGCTCGCC GCGAAGGCAG ACCAGGTCTG GCTCTTCACC
ACCGACTTCA AGATCCCATG GACGAACAAC CCCAGCGAGC AGGCCATACG CCTCCCGAAA
CGCCACCAGG CCGTCTCGGG CTACTGGCAC ACCCCCACCA TGGCCGCCTA CCTACGCGTG
CGCTCCTACC TCGTATCCGC CCGCGACCAC GGCCTGACCG CCGTCGACGC CATCCGCCTC
GCCCTCGCCG GTACGCCCTG GATGCCAGCC AGGCGCGCGG CCAGCCCCAC ACACGCTCTC
GCCACCTAA
 
Protein sequence
MIRVVAELSA SSWESRLAAA EARIEELAVG QAASVAANER LVSVNERLRR VVEDSAARHE 
VELGAVRAER DRAVRRVEEL ELEFAELRRR LSMDSTNSSV PPSKEPVGAR EKRKAERWAS
SERVRSKDRK PGGQPGHPGA GLAREETPDR SLSADPPSEC SSCQADLSDA GVLADGWAQV
WDLLEPVLEK VEWVLPRRRC ACCAKVTTAV VPGVRHAAAG AVSYGPRLHG AAVLLASEGN
VPVERAAMVI GALLGVPVSA GFVARANARL AQDLKAAGFD EAMKAALRAE PVLCGDESPV
NVLRKDRDEA TGTALSGTPH LLVVRTPAPG LVWYAAISSR SSGAIDATGV LTGWHGYLTR
DDYAGWHQYD PTLAGVQLCC AHLIRALRAV LILAPKVQKW AGRLIDLLRE ANGLVVAARA
AGNSRLGQAA IDALRARYDA DVRIGELTNM SRPWADEKNH PGLVLARRLA AKADQVWLFT
TDFKIPWTNN PSEQAIRLPK RHQAVSGYWH TPTMAAYLRV RSYLVSARDH GLTAVDAIRL
ALAGTPWMPA RRAASPTHAL AT