Gene Francci3_4012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4012 
Symbol 
ID3906973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4796627 
End bp4797937 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content56% 
IMG OID637881341 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_483091 
Protein GI86742691 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.848484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.567977 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATG GACTCCGACA TATTTCCATA GAATCGTTCG GCGAGATCTT CCCGGGAAGA 
ATCTCAACCG TCGGCACAGA ATTCGAGATA CAGTCCGGCA TAACCCTTTC ACCACGGAGA
ACGTCCGGCA GGAAAGATGC ACCCTACCTC CGCGTTGCGA ATGTACAACG CGGTCGTCTC
ACACTAAGCG ATGTCGCATG GCTAGAGGCA TCAGCTCGCG AACGGATTAG GTACGCACTG
GATGACGGAG ATCTACTCGT AGTTGAGGGG CACGCCAACC CAGCCGAGAT CGGAAGATGC
GCCCAGGTGG GGCCAGAGTC GAAGAATTGC CTCTACCAAA ACCATCTGTT TAGATTGCGC
CCAAGGAATC TTGAAGCGAG ATTTGCGCTA CACTGGCTGA ATTCCAGCTT TTCCCAGTCC
TACTGGGGGA GAAACTGCGC CACAAGCTCC GGTTTGTATA CGATTAATTC TCGACAGCTG
GGGGCACTTC CAATTCCGGT CCCACCGCCA GATAAACAAC GTAAGATTTC CGAGATCCTG
GACGCGGCAG ACGAGGCGAT CCGTTCAACG GAGCGACTCG TCGGCAAGCT CGAACAGGTG
TTCGACTCAT TGCGGGGCGA TCTACTTCAG GAGCATGTAA TTCGGTCGGG TCGACTTCCC
GACTGCTGGC GGATGGACCG GCTAGACCGT CTGAGCGAGA TCACGGGAGG CGTAACGCTC
GGCGGTGTTA CATCCGCTGG CCGTTCAGTC GAGCTTCCCT ACCTTCGGGT CGCAAACGTG
CAAGATGGAT ATATCGACAC TACCGACATC AAGACGGTAA CCGTGCGAAC ATCGGAGTTT
GATCGCTACC TGCTTCAAGC TGGAGACGTT CTCATGACGG AGGGAGGGGA CTTCGACAAG
CTCGGGCGTG GTGCCGTCTG GGACGGGTCG ATTGACCCCT GCCTACACCA AAATCATATC
TTCCGTGTTC GCTGCGACAA GATTCGCCTG CTCCCCGAGT ATTTGTCTAC CTACAGCGCA
TCCACTGCAG GGCGCAGCTA CTTCATGGGC ATCTCGAAGC AAACTACCAA CCTGGCATCG
ATCAACAAGA GTCAGCTATC CGCACTCCCC GTTCCACTAC CTCCACTGGC GACACAGAAA
ATGATAATTG GATCACTGGG CGCTGCCGAA CGACAGATAT CCTCGACAAA GGCCGAGCTG
GCGAAGTTGC GACTCGTCAA GCAGGGGCTG ATGGATGATC TGTTGATGGG GCGGGTTCAG
GTGTCGGGGT TGCGGGATGT GTCGGATGCA GTGGATACGC TGGCGGTATG A
 
Protein sequence
MSDGLRHISI ESFGEIFPGR ISTVGTEFEI QSGITLSPRR TSGRKDAPYL RVANVQRGRL 
TLSDVAWLEA SARERIRYAL DDGDLLVVEG HANPAEIGRC AQVGPESKNC LYQNHLFRLR
PRNLEARFAL HWLNSSFSQS YWGRNCATSS GLYTINSRQL GALPIPVPPP DKQRKISEIL
DAADEAIRST ERLVGKLEQV FDSLRGDLLQ EHVIRSGRLP DCWRMDRLDR LSEITGGVTL
GGVTSAGRSV ELPYLRVANV QDGYIDTTDI KTVTVRTSEF DRYLLQAGDV LMTEGGDFDK
LGRGAVWDGS IDPCLHQNHI FRVRCDKIRL LPEYLSTYSA STAGRSYFMG ISKQTTNLAS
INKSQLSALP VPLPPLATQK MIIGSLGAAE RQISSTKAEL AKLRLVKQGL MDDLLMGRVQ
VSGLRDVSDA VDTLAV