Gene Francci3_0119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0119 
Symbol 
ID3903449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp144898 
End bp148125 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content56% 
IMG OID637877452 
Productsignal transduction protein 
Protein accessionYP_479242 
Protein GI86738842 
COG category[T] Signal transduction mechanisms 
COG ID[COG5635] Predicted NTPase (NACHT family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.955952 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGGGC AAGGGAACGA GGCGGGCAGC CTGCATCGAG CCGGTGTCGC CGCTTGGCTG 
GTGTCGCATG GAATTAATGG AAATCCTATT CCTCTGGACG ACACGGCGAA TTCTGGAACG
ATTCAGAGCG TCGACTTTGA AACAAATGAT GCTATCGACG ACATACATTG TAAGCTAACG
GACGGCCGTA CACTGCTAGT ACAGGCGAAG CGAGCTTGCG GTGACGATCG ACATTTGAAG
GATACCGTCT ACCAGTGGGC ACGCCAACGG CCCGAAGCCG GTGTAATATT CGGCCTCGCT
GTTGCTGAGC CGAAAGGACC AGTTCGGCAT CTCCGTACGG CCTTAAACCG GCGTAGGAGG
TCGATACCAG GAAACCCAAG CTCTGGAGAG AAAAGCGCGC TGGCAGCACT CGCCAAAGCA
TTCCCTGTAG ACATCTCAAC CTCCCGACGC GAAGAAATCT TGGACTCCGC TTATGTGCTG
GTCGTCGATG CGCATGAGCC GGGGTTGCCC CATTTTCAAG CTGCAGCTTC CTGGTTGGTA
GCTGCAGGTG TTCCACATGA GCGCGGGGCC GAAGCATTCG AGGCATTGAG ATCGCGGCTT
CAGGCCGACG CGGCGAAGGC ACGCGGGAGT GATCTGGACG ATTGGCTCGA AATACTTGCT
AAGGCTGGAT ACCCGCTCGC AGAGGAATAT CGTGGGTCAC GCGGCCAGCG CCGCCAGTTC
GACTTGGGAG TTCTGGCGCG ATATTGCGAC CGCTTTCAAG CCGACCGCGG AATGCTATCT
TTCGCGCTAC TCGCCGATGA CCTACCGCCG ATGAAGGTCG AGGGCCTGGC GAAATCATTC
ACGGTTAGTT GGTCGTCGAC CTTCGAAAAT CGGAAGTATG CTTCGGAACC CTTGCTTGAC
GTCGCTCGGC GCTGGGGCCG GCTAGTGGTT ACCGGTCTGC CTGGTTCTGG AAAGTCGACA
GCCATAAGAC AGCTTGCGGC GGAATGGGCA GCCACGTCGG ACGCGCCCAT ACCTATTTTG
GTGTCGCTAA AGGTCGTAGC CGAGAATCCT CCCGAGCCGG CTTCATTCAC TCCGGCTATG
CTCGTCGATG CCGCGCTTCC TAAGCTCGGA GGAGACGAGA GGAGAGCCTT GCACGCCACT
CTACTGGCTA AAATCGGAGA CGGAGACGCT GCGCTTCTCC TCGATGGCCT TGATGAGTGC
GGGACCGCAA CCAGCACGAT AGCTGATGGA ATAACTCGAC TCCTCGAACA GCTACACCCG
GATACCGACG TAATAGTTAC GGCGCGAGGG AGCGCCCTTC CTGCCGCTTC GAAGACAGGA
CTTCCTACCG TTGAGTTGAC TGAACCGCAA GGTCTCACGT CTCAGCTTGA CAATCTGATC
CGCCATGCGG CAAAAATTCG AGCGAACGGC CAGCAAGAGG CCGAGTGGGC GGCCGCGCGT
ATCTCGTGGC TAGAGTCCGT CCGCCGCGCG GATGGTCATG GTGGCGGTCC GTTGCTGGTT
CGATCAAACA GCCTTTGGCA GGTTCCGCTA CTTGCGACAA TGGTTACACT TCTGGCGACA
TTGCGGCCGA CCGAAAAGAT TCCTACCAAT AGATCACTTT TGTTCATGGC GGCAGTGGAG
GAATCGGTCA CCAAATGGGA ATCACGGCGT GCCATACAGC GCGTGCCATG GAACGTAAGT
GACCCTCGGA TGCTCGTCGA CGGCTTTGCC GTGATCGCGC ATGAGCTCGC CCATCGTAGC
GGTCGCTTGT CGGTTGAAAT CGCTAGAAAA TCGATCAGAG ACCGCGCGGT TCAATATTGG
GGAATGTCGC GGGGGCCAGC AGAGTCGGCC TCTGCAGACA TTGTTAGGTT CTGGGACGAA
GTTATCGGGG CCTTTCCGAG AAATTCAGAT GGAATGCTTC TTTCGCGAGC GCAGTATCTT
GTGGATATTG GCGACGCGAT GTGGGCCGCT CACGTTCTCG AAGACGAAGA GGAAAGAAAG
GCTTGGGTGA ATCGTGCGCT CGAGGATCCG AACCGCCGCG AGGCTTTTAT TCTAGCTGTT
TTGTACGCGC CGTCGATTTT GCATAGTATC GAAACCTTCC ATGGTAGTGG AAATGATCGC
GGGCGAAGGG CGCTTGTTTG GGCCGCTGAC GCCATCAGAG AGGGTGGTAG ATCTCTGATG
AATGTTGTCA CCCCCGCCCT GATGGCAGCA CTTGCAGATG CTGCAGGCAA AGGATTGGCC
CTCCCCGATA TCGGTAGCAA CAGTGCAGGT CAGAGGAGCG GGCGAGACCC CAAGGAGTGG
GAGTACGTCC TACGGCTGGC TGGTCTGCCG GTCGTCCCCC AACTCCGCAA GTTGCGGGAC
GACTTGCTTG CCGGACTGCG GTTGAACAAG GAGCACCGTT TGGTGTCCCA AGCGCTCACT
TCACTTACCG ATGCGGTGGT AGACGGGCAT ACCGAAATAC CAAACTCTTC CTTGGCTGCG
CTTAAAGAAA TGCTGGCCGT TCCGGCGCCA CGGCGAGTAA ATCGTCCACC GCGCAGAAAT
TCGCGTGGAG TTATGAGCTT CACTCTACCG GAACGCCGGC GTCCGATAGG ATATGCCGAC
GTGCTCAAAC TTGCAGCCGA GTACGCTGGA AGTCTCGATT CATCGACCCG AACGGCTATC
TACAGCGAGG CTCGGGAAGC TAGTGTTCGC GACTATTTTG AAATCGTAGC CGTTTTGCAG
GCAAAGGGAT TCTCTGATCC GCAACCATTT TTCAGAGTCC CTGAACTTAT TCGCCTAACC
GAGGAGTTTC CCAGTCATTG GGCGGCTGTC GAATGGCTGT TTCGCCCTAT GGAAAATATC
TCGGAAGGTA CGCAGCTGCG GCGGGTGCAG CGGTGGAGAC CCGCCGAGAT TCTAGAGTTC
ACCGAACTCA TCGGTCTACG TGCATCTGGT CTTGATGATC TGAGGGCCGG CAGGAATGAG
GCCACGACGA CCATAGAATC TCTAGTTAAA TGTGTTTGCT CATCCTATAG AATTTCGGAA
TCTCATATCG CGGCTCAAGC GGCCTACATT GCAAGTATCG CATCGACTGG GGACCAGCAG
ATTGTCCGAA TTATTCTGGG GATTCCCCCT GCTCGCGAAG TGCGTTTCGT AGTGCCAAGT
ACGGAATTTC GGCTCGAGGG CCTTGAGTCT TTGCTTGCTG GGTACGCAAG TACATGCCAG
ATACTGAGCC TTTTCAATCT CTGTTTGTCT GATTCTTTCG GCCGTTGA
 
Protein sequence
MSGQGNEAGS LHRAGVAAWL VSHGINGNPI PLDDTANSGT IQSVDFETND AIDDIHCKLT 
DGRTLLVQAK RACGDDRHLK DTVYQWARQR PEAGVIFGLA VAEPKGPVRH LRTALNRRRR
SIPGNPSSGE KSALAALAKA FPVDISTSRR EEILDSAYVL VVDAHEPGLP HFQAAASWLV
AAGVPHERGA EAFEALRSRL QADAAKARGS DLDDWLEILA KAGYPLAEEY RGSRGQRRQF
DLGVLARYCD RFQADRGMLS FALLADDLPP MKVEGLAKSF TVSWSSTFEN RKYASEPLLD
VARRWGRLVV TGLPGSGKST AIRQLAAEWA ATSDAPIPIL VSLKVVAENP PEPASFTPAM
LVDAALPKLG GDERRALHAT LLAKIGDGDA ALLLDGLDEC GTATSTIADG ITRLLEQLHP
DTDVIVTARG SALPAASKTG LPTVELTEPQ GLTSQLDNLI RHAAKIRANG QQEAEWAAAR
ISWLESVRRA DGHGGGPLLV RSNSLWQVPL LATMVTLLAT LRPTEKIPTN RSLLFMAAVE
ESVTKWESRR AIQRVPWNVS DPRMLVDGFA VIAHELAHRS GRLSVEIARK SIRDRAVQYW
GMSRGPAESA SADIVRFWDE VIGAFPRNSD GMLLSRAQYL VDIGDAMWAA HVLEDEEERK
AWVNRALEDP NRREAFILAV LYAPSILHSI ETFHGSGNDR GRRALVWAAD AIREGGRSLM
NVVTPALMAA LADAAGKGLA LPDIGSNSAG QRSGRDPKEW EYVLRLAGLP VVPQLRKLRD
DLLAGLRLNK EHRLVSQALT SLTDAVVDGH TEIPNSSLAA LKEMLAVPAP RRVNRPPRRN
SRGVMSFTLP ERRRPIGYAD VLKLAAEYAG SLDSSTRTAI YSEAREASVR DYFEIVAVLQ
AKGFSDPQPF FRVPELIRLT EEFPSHWAAV EWLFRPMENI SEGTQLRRVQ RWRPAEILEF
TELIGLRASG LDDLRAGRNE ATTTIESLVK CVCSSYRISE SHIAAQAAYI ASIASTGDQQ
IVRIILGIPP AREVRFVVPS TEFRLEGLES LLAGYASTCQ ILSLFNLCLS DSFGR