Gene Francci3_2841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2841 
Symbol 
ID3904753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3347359 
End bp3348711 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content66% 
IMG OID637880162 
Productzeta toxin 
Protein accessionYP_481928 
Protein GI86741528 
COG category[S] Function unknown 
COG ID[COG4185] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.21799 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCAGGCG CTTCTGAATA CGATACCGTT GGCCCGGTCA CGAGAATGGC TCTCTTCTGC 
CCCGGGAGCG TCCATGGCTG TTCGCCGGTC TCCGGCCAGG TTACCGCGGT GACTCCGGCC
GAGTTCCGTG GCCTCGAAAC CGCCCCGGAA CGACCCGACG CAGGCGGTGG GCCGGTCCCG
GAGCAGTCTC GGGTCACCGA GATCGACAGA AAGCTCGACC TGCTGGACCG CGCCGCCTCC
CCGGGCGACT CACCCGAGCC CCGCCATCAG GAGCGACCGG CCGGCCGGGA TGCTCCCAGC
CGGAACACCT CCAGCATCGA CGCCAAGCTG GACCTTCTCG ACAGGGCGGC GCTCGCCCGT
TCCGGCGGCG GCGCGGCTAC GCCGGCGGAC ACCACCGGCG ACCGGCCGTC CGAGGTCCGC
CCGCCCACCG AGCCCGGGAA CAGCGACCGG GCTCGGACCG AGGCCAAGCT GGCGCTGTTG
GAGGACGCGG CCCGCCGCTA CCGCCCCGAG CCCCCGGACG CGCCCGCCCC CGGTCGGGAG
CGCTGGGCCG TCCGCGAGGC ACCGCGGACT CTGCCGGACG ACCATCCGCT CCTCACCCCG
ACGGACACCA TCAACACCCC CGAACGCGCC GCCCTCCGGG AGAATCTGGT GAAGGAGGTG
ATCGGCGATG CCAAGCCGCC GGAGCAGGGC AGCCCCACCC TCGACCTCAT GGGCGGCGGC
GGAGCCTCCG GCAAGGGCTT CGTGCTGGAG TACCTCAAGG ACGAAGGCCA AGTACCCACC
GAGAACGTAG TCCATCTTGA TCCCGACGAG ATCAAGAAAA TGATCCCCGA GTTCGACGAG
ATCATGGGTG CAGGAGACTC GCGCGCGGCT GAGGTGGTCC ATGAAGAGAG CAGCTCACTC
GCGAAGGGAG TCCTTCAACA GGCCATGGAC CGCCGCCTCA ATATCATCTA CGATAGCACC
CTCGGCAACC CGGAGAAGAC CGCCAAGCTG ATCGATGACG CGCATGCGAA GGGATACGAG
GTTCGCCTAT TCGGGGTGAG TGCCGATCCG GAGCTCGCGG TCACGCGCGC CGCGGACCGC
GCCGCAAAGT CCGGCCGCTA TGTTCCCGTT GACCACCAGC TTGCGGCACA CCGTGGATTC
TCCCAGGGCT TCGAAGGTTA TGCCGAGAAG GCCGATAAAG TACGTCTTTA TGACACCAAC
TCTGAACCCC GACAGATCGC CCGCAAGAGG GCGGGCGAAA TTTTGACAAT TCTCGACCAA
GGATCGTACG ATAAATTTCA AAATAAAATA AACATTAATC CAGAAGCCAT GGGGCCGACA
TCACTGTACA CCGATCGAGG CGAAAACCAA TAA
 
Protein sequence
MPGASEYDTV GPVTRMALFC PGSVHGCSPV SGQVTAVTPA EFRGLETAPE RPDAGGGPVP 
EQSRVTEIDR KLDLLDRAAS PGDSPEPRHQ ERPAGRDAPS RNTSSIDAKL DLLDRAALAR
SGGGAATPAD TTGDRPSEVR PPTEPGNSDR ARTEAKLALL EDAARRYRPE PPDAPAPGRE
RWAVREAPRT LPDDHPLLTP TDTINTPERA ALRENLVKEV IGDAKPPEQG SPTLDLMGGG
GASGKGFVLE YLKDEGQVPT ENVVHLDPDE IKKMIPEFDE IMGAGDSRAA EVVHEESSSL
AKGVLQQAMD RRLNIIYDST LGNPEKTAKL IDDAHAKGYE VRLFGVSADP ELAVTRAADR
AAKSGRYVPV DHQLAAHRGF SQGFEGYAEK ADKVRLYDTN SEPRQIARKR AGEILTILDQ
GSYDKFQNKI NINPEAMGPT SLYTDRGENQ