Gene Francci3_0067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0067 
Symbol 
ID3905402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp83126 
End bp84502 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content57% 
IMG OID637877397 
ProductHNH endonuclease 
Protein accessionYP_479190 
Protein GI86738790 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGATGG TCCCATCGCG CGCCATAATC GAACACATGT TCGGTAAAGA TCTGGAGACG 
GTTCCCCTCC CTGCCCTCGA GACGGAACTC TGCAGCTGGG CGGGTCGACT GGCTGCCGCC
ACCTGCCGTT GGCTGATTCT GCTCGCTGCC TTCGATCGCC GCAAAGGTTG GTCCGCCAGC
GGCATGCCGA CCTGCGCGCA CTGGTTGTCC TGGCGGTGCG GACTCGGACT TCGCGCCAGC
TACGACTACC TTCGAGTCGC CCGAGCACTA GAACTCCTTC CCCTCATTCG CGAGTCCTTC
TCCAAAGGAG AGATCTCCTA TTCAAAGGTT CGGGCTATAA CACGTGTCGC CGAGCCGGAG
ACGGAAGCAA GGTGGGTTGA GCAGGCCGCA CAGTGCACAG CGCAGAAGCT CGAGAGACTC
GTGTCCTTGC ATGCCAAAAT CAATCATGAT CAAAAAGATG AGAACGGGGG GCGCGACGAA
GATGGCCCGA ACAATACGCG GTGCTCTTGG CGGTGGAACG AAGATGGAAC ATTTTCACTG
TCCGTCCGCC TCGATCCAGC CCGTGGTGCG ATCATTGAGA GCGCTCTCGT CATGGCCATG
TCAAGCCTGC ACGACTCCCG AGACTCTAAC CGGGAAGATT CTTCCACGAC ATCCGATGCC
GAGGGCTCGA TCAATGACAC CTCGATAACA CACGCAGGTC CTGAGATGAA AGCCGATGCC
TTGACGGCTA TGTCCGAATC TTTCCTCTCC ACCGGCGCTC CCACGCTGAT GAGCTCCACA
TCTCACACGA TAAATGTACA CATAGATATT GATACACTTA TCGGTTCAAG CCGAGAGAAC
CATGGATCCC CCCTCCAGCG ACATGAAGGG AATGGACTGA ATACCCGAAG GTGTGATGTG
AAGGACCATA TCCCCGTTCT ACCGAACGTC GTTCGAAGAC TGTCCTGCGA CAGCCTCCTT
CGGACACTTA TTATAGACTC CAAGGGAAAC CCTCTCATGC TGGGTCGCAC CCGCCGAAAC
CCAACCACGA GATTGCGGCT AGCAATTTAT GCACGCGACC GAGGGGTATG CCAGTATCCG
GGCTGCCATC ATACCCGCTG GCTCCAGGTA CATCATATGA AGGAATGGGC ATCCGGAGGC
GGAAATACAG ATCTTGATAA TCTCGTTCTG ATCTGCTCCC TTCATCATCG GACTATTCAT
GAAAGGCGGA TTGTTCTGCA GCGCGGACGC GACGGTTCGA TTGTCGCCCG CCATCGTGAC
GGAACGCTGA TGCAGCAGGC GCCACGGCTG CATCTGGGTC CGGATCTGTT GGAGCTCCTC
AGCGATAACA CCTCAGCCGC GCCAGCTGAG ACCGTCCCGA CAAGACGAGT AGCCTGA
 
Protein sequence
MTMVPSRAII EHMFGKDLET VPLPALETEL CSWAGRLAAA TCRWLILLAA FDRRKGWSAS 
GMPTCAHWLS WRCGLGLRAS YDYLRVARAL ELLPLIRESF SKGEISYSKV RAITRVAEPE
TEARWVEQAA QCTAQKLERL VSLHAKINHD QKDENGGRDE DGPNNTRCSW RWNEDGTFSL
SVRLDPARGA IIESALVMAM SSLHDSRDSN REDSSTTSDA EGSINDTSIT HAGPEMKADA
LTAMSESFLS TGAPTLMSST SHTINVHIDI DTLIGSSREN HGSPLQRHEG NGLNTRRCDV
KDHIPVLPNV VRRLSCDSLL RTLIIDSKGN PLMLGRTRRN PTTRLRLAIY ARDRGVCQYP
GCHHTRWLQV HHMKEWASGG GNTDLDNLVL ICSLHHRTIH ERRIVLQRGR DGSIVARHRD
GTLMQQAPRL HLGPDLLELL SDNTSAAPAE TVPTRRVA