Gene Francci3_4234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4234 
Symbol 
ID3907200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5051912 
End bp5053312 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content72% 
IMG OID637881560 
Producthypothetical protein 
Protein accessionYP_483309 
Protein GI86742909 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGATCAA CCACCCCCGC GCCGGCCGAA CGGCGTCTCG TCGACCGCCG ACTCGGCGTC 
CTCACCCAGA TCGTTCCCTA CCAGCCGAGC CCCGCCATGC CGCGTTGCTG GGTCGGCTGG
AGCGCCCGTG CCGCCGACAC CCGTGCGTTC GCCACCTGGT CGGCCGAGCG GTTTGGCTTC
GGTGCCGCCC TGGGCGATCA CGACCGGGCG CGCCGTGCCG CGGTCGGGGA GGCCGTGGAA
CGGTACTGCG GCAACGCCGT TCCCGACGCC CTGGAGATCG CCTGCTACGA CGATCTCGCC
AGGGCCGGGC GGCCGGCTCT CGACCCGGCG ACCCTGGCGC TCTATTCCGA CCGGCAGTAC
CGCGCCCGCG GGTTCCCCTT CCGGCCCTTC ACCCGCCAGA CACCGGTGGC CTGGGTGCCC
GGCCGCGACC TGTACGCCGG GGGTCCGGTG CTGGTGCCCG CGTCGATGGC CTACCTGAAC
TACTTCCGCG GCGCGCACGC CGACGAGGTG GCCACCCACG CCATGCTGTA CGCCGGCATC
GCCACTGGCG AGAACCGTGA GCACGCCGAG CGTTTTGCGT TGGAAGAGCT CTTCGAACGC
GACGCGAACA CCATCTGGTG GGCCAGCGGC GCCGCCGGCT GGGCCGTCGC CGATGCGGCT
GAGCTCCTCG ACCGGTACGA CATCGCCCAC GGAGAGGGCA CCGGTCGCAC CATCCGGTTG
TTCCAGGTGC CAAGCCAGTT CCCGGTTCCG GTGCTCGCCG CCTTCCTTGA AGAACCGGGA
CGGGGGTTGA TCGCGTACGG CACGGCATGC CGGGCGGATC CGCGGGAAGC GGCGACGAAG
GCGCTCGTCG AAGCCTTCGC CATGCTGGAA CTGACCGCCG AGCTCGCGGA CGGTGACAGC
GCGCACTGGC GAGCCGTCGC CCGCGGCGAG ATACCCCCGC ACACGTACCT GCCCTACCGC
GCCGACCGGC GGTACGCGGA TGACATCCGG CCGGATTTCC GCGACCTCGT CGACCTGCCC
GCGGTCGCCC AGCTCTATCT GGATCCACGA ATGCAGGGCC GGCCCCTCGA CCGGCTCCGC
GACGACACCC GCACCACCCG GCTCGCCGAC ATCCCCCGGG CCGACGGGGA CGCCAACGGG
GGCACAGCCC ATCGACGCTA CCTCGACATG CTCGCGACGC AGGGCCTGTC CGCCGTGTCA
GTGGACGTCA CCACTCCGGA CGTGCGGGCG GCCGGCCTCA CCGTCGTGCG GGTCATCGTC
CCCGGGCTCT ACGGCAACCC GCCCGCGGCC TTCCCGTTCC TCGGCGGCGA GCGGCTCTAC
GACGTACCCG CCCAGCTGGG CCTGGCTGCC GGAAAGATCA CCGAAGACGC CCTTTATCCG
TACCCGATCC CGCACGTCTG A
 
Protein sequence
MRSTTPAPAE RRLVDRRLGV LTQIVPYQPS PAMPRCWVGW SARAADTRAF ATWSAERFGF 
GAALGDHDRA RRAAVGEAVE RYCGNAVPDA LEIACYDDLA RAGRPALDPA TLALYSDRQY
RARGFPFRPF TRQTPVAWVP GRDLYAGGPV LVPASMAYLN YFRGAHADEV ATHAMLYAGI
ATGENREHAE RFALEELFER DANTIWWASG AAGWAVADAA ELLDRYDIAH GEGTGRTIRL
FQVPSQFPVP VLAAFLEEPG RGLIAYGTAC RADPREAATK ALVEAFAMLE LTAELADGDS
AHWRAVARGE IPPHTYLPYR ADRRYADDIR PDFRDLVDLP AVAQLYLDPR MQGRPLDRLR
DDTRTTRLAD IPRADGDANG GTAHRRYLDM LATQGLSAVS VDVTTPDVRA AGLTVVRVIV
PGLYGNPPAA FPFLGGERLY DVPAQLGLAA GKITEDALYP YPIPHV