Gene Francci3_3404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3404 
Symbol 
ID3905644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4037245 
End bp4038513 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content72% 
IMG OID637880727 
ProductTat-translocated enzyme 
Protein accessionYP_482487 
Protein GI86742087 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.564575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCCG GGTCCGACGG CGGGCGGATG GGACGCCGCT CCTTCCTCGC CGGAGCCTTC 
GCCGCGGGCG CCGCGGCCGC GGGCAGCGCC ACCACCGTCG CGGCCGGAAC CGGCGCTGCG
GTGCTCACCA CCGCCGCGCC GGCCGCCGCG GCCGCCAGCG GAAGCTCCGA CGAGGTGGTT
CCCTTCCACG GCGTGCATCA GGCCGGGATC CTCACCCCGC AGCCGGCCGC GGCCATGTTC
GCGTCCTTCG ACGTCATCGC CGACGACCGG GCCGCGCTCA CCGACCTGTT CAAGACGATC
ACCGCCCGGG CCCGGTTCCT CACCGCCGGC GGTGCTCCGG CCGACCTGGG CACCGTCGCC
CCACCGTCGG ACAGCGGGGT CCTCGGGCCG ACCGTGCCCC CGGACGCGCT CACCGTCACG
GTCGGGGTCG GCGCCTCCCT GTTCGACGGC CGGTTCGGCC TGGCCGACCG GCGTCCCCGG
CGGCTGGCGC CGATGCGTAC CTTCCCGAAC GACAACCTGA ACCCCGCCGA CTGCCACGGC
GATCTCAGCC TGCAGCTGTG TGCGAACAGC CGGGACACGG TGTTGCACGC GCTGCGCGAC
ATCGCCCGGC ACACCCGCGG CGGCATGCAA CTGCGGTGGC GGATCGACGG CTTCCACAGC
CAGCCCCGGC CAGCCGGCGC GCAGCGAAAC CTGCTCGGTT TCAAGGACGG CATCGTCAAC
CCCGATGTGA CCAGCGCCGC CGACATGGAC CGGCTGGTCT GGGTGGACGG TGCGGGCGAG
CCGCGCTGGA CCACCGGCGG CTCCTACCAG GTCGTCCGCA TCATCCGGAT GCTGGTGGAA
TTCTGGGACC GGGTTTCCCT CAACGAGCAG GAAACGATGA TCGGCCGGCG ACGGGACACC
GGGGCACCGC TGGACGGCAC CGCGGAGACC GACATTCCCA ACTACGCCCG CGACCCCAAG
GGCACGGCCA TCCCGCTCAC CGCGCACATC CGGCTCGCCA ACCCGCGCAC CGCGGCCACC
GACAACTCCC GCATCCTGCG CCGCGGCTTC AACTACGACC GGGGAACAGA CAGCAACGGC
AACCTCGACA TGGGCCTGGT CTTCTGCTGC TACCAGCAGG ACATCATCCG GCAGTTCGAG
GCGACGCAGA CCCGGCTCAT CGACGAGCCG CTCGTCGACT ACATCAGCCC GACCGGTGGC
GGCTACTTCT TCGTGCTCCC CGGTGTCCGG GACGCGAGCG ACCACCTGGG CCGGTCGCTG
CTGGCCTGA
 
Protein sequence
MSPGSDGGRM GRRSFLAGAF AAGAAAAGSA TTVAAGTGAA VLTTAAPAAA AASGSSDEVV 
PFHGVHQAGI LTPQPAAAMF ASFDVIADDR AALTDLFKTI TARARFLTAG GAPADLGTVA
PPSDSGVLGP TVPPDALTVT VGVGASLFDG RFGLADRRPR RLAPMRTFPN DNLNPADCHG
DLSLQLCANS RDTVLHALRD IARHTRGGMQ LRWRIDGFHS QPRPAGAQRN LLGFKDGIVN
PDVTSAADMD RLVWVDGAGE PRWTTGGSYQ VVRIIRMLVE FWDRVSLNEQ ETMIGRRRDT
GAPLDGTAET DIPNYARDPK GTAIPLTAHI RLANPRTAAT DNSRILRRGF NYDRGTDSNG
NLDMGLVFCC YQQDIIRQFE ATQTRLIDEP LVDYISPTGG GYFFVLPGVR DASDHLGRSL
LA