Gene Francci3_2949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2949 
Symbol 
ID3903764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3483997 
End bp3486126 
Gene Length2130 bp 
Protein Length709 aa 
Translation table11 
GC content68% 
IMG OID637880270 
Productcatalase 
Protein accessionYP_482036 
Protein GI86741636 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG0693] Putative intracellular protease/amidase
[COG0753] Catalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.962169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.093907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC CCGGGACGTC GCGCCTCCCC CGGGACGACG CGAAGCAGCA GCAACTCGAC 
GCCGTCCGAA TCGGCGACGA CGGCGCCCGC ATGACCACCG ACCAGGGGAT CGGTGTCGAA
CACACCGACG ACTCGCTCGC GGCGGGGGAA CGCGGTCCGA CGCTGCTGGA GGACTTCCAC
TTCCGGGAGA AGCTCACTCG GTTCGATCAC GAGCGCATCC CCGAGCGGGT CGTCCACGCC
CGTGGTGCCG GTGCCTACGG GTACTTCGAG GCGTACGAGT CGCTCGCCGA CGTGACCCGG
GCGCACTTCC TGGGCGAGGA CGGCCGGCGT ACCCCGGTGT TCGTACGGTT CTCGACCGTC
GGCGGTTCCC GCGGTTCCGC GGACACGGTC CGGGACGTAC GCGGATTTGC CGTAAAATTT
TATACCGAGG AGGGTAACTT CGATCTCGTC GGTAACAACA TGCCGGTGTT CTTCATTCAG
GACGGCATCA AGTTCCCCGA CTTCGTCCAT GCGGTGAAGC CGGAGCCGCA CAACGAGATC
CCGCAGGCAT CCTCCGCGCA CAACACCCTG TGGGACTTCG TCAGTCTCGT CCCCGAGTCG
ATGCACATGA TGATGTGGCT GATGTCGGAC CGGGCGCTGC CGCGCAGCTA CCGGATGATG
CAGGGCTTCG GCGTGCACAC CTTCCGGTTC GTCGATGCCG CCGGGCAGGG GACCTTCGTC
AAGTTCCACT GGCGGCCGAA GCTCGGCACG CACTCCCTGG TCTGGGACGA GACACAGAAG
ATCGCGGGCA AGGACCCCGA CTTCAACCGG CGGGACCTGT GGGAGTCGAT CGAGAACGGT
ACGTTCCCGG AGTGGGAACT CGGCGTGCAG CTCGTGCCGG AATCCGACGA GCACGCGTTC
GACTTCGATC TTCTCGACGC GACGAAGATC ATCCCTGAGG AGCGGGTGCC GGTGCGGCCG
GTGGGACGGC TGGTGCTCAA CCGCAACCCC GGCAATTTCT TCGCCGAGAC CGAGCAGGTG
GCGTTCTGCG TACAGAACGT CGTCCCCGGC ATCGACTTCA CGAACGACCC GCTGCTGCAG
GCCCGGCTGT TCTCCTACCT GGACACCCAG CTGATCCGGC TCGGCGGCCC GAACTTCGCC
CAGCTGCCGG TCAACCGGCC GGTCGCGGCG GTGCACAACA ACTCCCGCGA CGGTTACGGG
CAGCATCGCA TCCAGCAGAG CCAGACGTCG TACTTCCCGA ACTCGATCAG CGGCGGTTGC
CCGGTGCTCT CCGATCCGGC GCACGGCGGG TACGTGCACT ACGCCGAGCG GGTCGACGGG
AACACGATCC GTAAGCGGAG CGAGAGCTTC AAGGACTTCT ACAGCCAGGC CACCCTGTTC
TGGAACAGCA TGTCCTCCTG GGAGCGGCGG CACATCGTCG ACGCGTTCAG CTTTGAGCTC
GGCAAGGTCG ACTACACCCA GATCAAGGAG CGTGTCCTGG GACATCTCGC CCAGGTGGAC
CACGAACTCG CGGCCGGGGT CGCCGAGAAC CTCGGACTGC CGGTGCCACC GGAGAGCACC
CCGAACCACG GACGATCCTC ACCCGCGCTG AGTCAGGCCG ACCAGCCCAG CGGGGTGGCC
ACCCGCAGGA TCGCGGTGCT CGCGGCGGAC GGCGTGGACG AGGTGGCATT GCGTTCGGCC
ACCGCCGCGC TGCGCGAACA GGGCGCGATC CTGGAGGTTC TCGCGCCGCA CGGAGGCATG
CTCGCCACGG TGTCCGGCGA CGCGTTGCCG GTGGACCGAA CGCTGGTGAC CATGTCCTCG
GTCCTCTACG ACGCGGTGTT CGTCGCCCCG GGCGAGCGCG GTGTCACCGC ACTGACACAC
AACGGCGAGG CCGTGCACTA CGTCGGCGAG GCGTACAAGC ACGCGAAGCC GATCGGCGCG
GTCGGCGCGG GGGTGTCGCT GCTCGAGATC GCATCCTTGC CGGGCGCGCG GGTCGCCGAC
CAGGGTGACG CGGTGGTGTC CGACCGGGGT ATCGTGACGG TCCGCGACCT TGCGGGGCCG
TCCGTGCTCG GCGACTTCGG TTCCGCGTTC GCCACGGCCG TGGCCGCCCA TCGGCACTTC
GACCGCGCGC TGGAGGCCGT CGCCGCGTAG
 
Protein sequence
MTDPGTSRLP RDDAKQQQLD AVRIGDDGAR MTTDQGIGVE HTDDSLAAGE RGPTLLEDFH 
FREKLTRFDH ERIPERVVHA RGAGAYGYFE AYESLADVTR AHFLGEDGRR TPVFVRFSTV
GGSRGSADTV RDVRGFAVKF YTEEGNFDLV GNNMPVFFIQ DGIKFPDFVH AVKPEPHNEI
PQASSAHNTL WDFVSLVPES MHMMMWLMSD RALPRSYRMM QGFGVHTFRF VDAAGQGTFV
KFHWRPKLGT HSLVWDETQK IAGKDPDFNR RDLWESIENG TFPEWELGVQ LVPESDEHAF
DFDLLDATKI IPEERVPVRP VGRLVLNRNP GNFFAETEQV AFCVQNVVPG IDFTNDPLLQ
ARLFSYLDTQ LIRLGGPNFA QLPVNRPVAA VHNNSRDGYG QHRIQQSQTS YFPNSISGGC
PVLSDPAHGG YVHYAERVDG NTIRKRSESF KDFYSQATLF WNSMSSWERR HIVDAFSFEL
GKVDYTQIKE RVLGHLAQVD HELAAGVAEN LGLPVPPEST PNHGRSSPAL SQADQPSGVA
TRRIAVLAAD GVDEVALRSA TAALREQGAI LEVLAPHGGM LATVSGDALP VDRTLVTMSS
VLYDAVFVAP GERGVTALTH NGEAVHYVGE AYKHAKPIGA VGAGVSLLEI ASLPGARVAD
QGDAVVSDRG IVTVRDLAGP SVLGDFGSAF ATAVAAHRHF DRALEAVAA