Gene Francci3_1132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1132 
Symbol 
ID3906611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1349005 
End bp1350369 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content61% 
IMG OID637878463 
Productrestriction endonuclease 
Protein accessionYP_480240 
Protein GI86739840 
COG category[V] Defense mechanisms 
COG ID[COG1715] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.574602 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGAGC GAGAGGGTGT CGCTGTGAGT CGGGTGGCTT GCGAGCCGCG CCCGGGAGAT 
GGTGTCAACG TCCGCGGTCG TTGCTACAGC GGTAGCTACA CCTGGCGGGA CTCCCGACCG
TCCGATTACA AGATCGGTAC CATCCGTCCG TGCGCGTTCG CGAAAGCCTA TACAGGCTGC
TTGACGACCT GGCCACCTCG GCCTCCATCC GCTGACGTCC GTCGTTGTTC GCTGGCCCGG
CTGTGTAGAT GCCCATACCG CCGCCACTTC TCAGGAGTGG GCTGCGCTCC GCTGTCTCCA
CGAGTGCAGT GGCGTTCACG TTCGCTTGTG CCCGCAACGT CAGTCCCCAA CTCCCGGCCT
GGGAACGGAC GCGGGCCATG CCGAGACGGT CGCCACCCCG CGCGTACAGG GGAAGTCGCG
CGACGTTCTA AGATCGTGTT ATGCGAGTAT CGGGGAGGGG TGCGCCGGGT GACGAGCCCT
GACGTCGGTG TGCCGACATA CGATCAGCTG CTCTGGCCGA CGATCAACGC ACTGCGCGAT
CTTGGAGGGT CCGGCGCGAT CTCGGAGATT AATGAACGAG TCATCGAGCT GGAGAAGTTT
AGCCCAGAGC AGCAAGATCA GCTACATGGA GACGGGCCGC GGACGGAGAT CGAGTATCGG
TTGGCCTGGG CTCGTACCTA TCTGAAGATG ATGGGCTTGA CCGACAATAG CAGTCGTGGC
GTTTGGACGT TGTCGGAACC AGGGCGCTCC GTTGACAGGG CAGATATTCC CAAGATCCAT
GCTGAAGCCA AGCGTGCTTG GGCCGCTGAA CGTGCCGAGA AGGCGCGCAA GCCGGCCGAT
TCAGAACCCA AGGCCACCTA CGAACAGGCG ACGTCGGACG ACGGAGAAAA AGACCAGGAA
ATCGCCTGGA AAGAGGAATT GCTCGAAGCG GTGATGGCGA TGCCGCCCGA CGCATTCGAG
CGCCTTGCTC GCCGGCTCCT CCGGGAGGCC GGGTTTATCA GCGTCAATGT CACCGGCGGC
ACCGGCGATG GTGGCATCGA TGGGCTGGGA ATCTACCGTC TCTCGCTCGT CAGCTTCCCG
GTATTCTTCC AATGTAAGCG GTACAAGGGA AGTGTTTCTC CGAGCGCTGT CCGAGATTTC
CGGGGGGCGA TGGCGGGCAG GGGAGATAAA GGGCTTCTGA TCACGACGGG ATCGTTCACC
TCTGCCGCTC AAAAGGAGGC AACCCGCGAC GGCGCTCCTC CGGTTGATCT GATCGACGGG
GATAGACTCT GTGACCTCCT TCGTGAATAC AACCTTGGAG TCAAGGTGGA GATCGTCCAG
CACGAAAAGG CCATAGTTAA CCGAAGATTC TTTGACGACA TTTGA
 
Protein sequence
MTEREGVAVS RVACEPRPGD GVNVRGRCYS GSYTWRDSRP SDYKIGTIRP CAFAKAYTGC 
LTTWPPRPPS ADVRRCSLAR LCRCPYRRHF SGVGCAPLSP RVQWRSRSLV PATSVPNSRP
GNGRGPCRDG RHPARTGEVA RRSKIVLCEY RGGVRRVTSP DVGVPTYDQL LWPTINALRD
LGGSGAISEI NERVIELEKF SPEQQDQLHG DGPRTEIEYR LAWARTYLKM MGLTDNSSRG
VWTLSEPGRS VDRADIPKIH AEAKRAWAAE RAEKARKPAD SEPKATYEQA TSDDGEKDQE
IAWKEELLEA VMAMPPDAFE RLARRLLREA GFISVNVTGG TGDGGIDGLG IYRLSLVSFP
VFFQCKRYKG SVSPSAVRDF RGAMAGRGDK GLLITTGSFT SAAQKEATRD GAPPVDLIDG
DRLCDLLREY NLGVKVEIVQ HEKAIVNRRF FDDI