Gene Franean1_4351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4351 
Symbol 
ID5672706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5194317 
End bp5197163 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content71% 
IMG OID641243224 
Productcyclic nucleotide-binding protein 
Protein accessionYP_001508641 
Protein GI158316133 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1225] Peroxiredoxin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTCA GCACGCAGCC CCAGCAGCGG CAGGCCGCGC CGGCACCCGC GGCCACGGAG 
ACAGCGACAG TCACGGTCTG GGACCGGCTT GCCGACGCCG CCAACCCGGC CCGCTACCGG
CCGAAGCGGC AGGACGGGCT GATCTGGCGC GAGTTGCGAT CCGCGCGGGG CGAGGAGTAC
GTCATCGTCC AGAACCCCGA CGCCGCGACG TACCGCAAGA TCACGATGGC CGAGTTCTAC
CTGTTCGAGC TGATGGACGG CAGCCGCAGC GTCCAGGACC TGGTCGTCGC CTACATGATG
AAGTACCACC GGTTCGCCCT GCCGCTGGTC CTTCGCCTCG TGCGCAACCT GAAGGTCGGG
CAGATGCTGA CCGACCCGCC CCGGTTCGTG TTCGGGCCGT TGGGCGAACA GCTGCGCCGC
CGGCCAGTGA GCTCCTTCGC CACCGGGTTC GCGAAGTCCT TCGTCCGGCG CGAGTTTCCG
CTGCACGGGC TCGACGGCCT GGTCGGCCGC GCCTACGACC GGGGCGTGTG GGTGCTTTTC
ACCCGGCCGG CAAAGATCGT CATGTTCGCG GTGGCGATGC TCGGCGTACC GCTGCTGGCC
TGGTCGCTGT CGTCGTCGGC GTCGGGGCTG GCCGACCAGT CCCTCGTCGT CACAGTTCCG
ACAATGTACG GCTGCCTGCT GGTCGTCGCC GTGCTGCACG AACTGGCGCA CGCCTTCGCG
GCCAAGTCCT ACGGCCGGGT GGTTCGGCGC GGCGGACTAT CGATCTTCTA CGGCTCGCCT
GGGATGTTCG TCGACACCCA GGACATGTGG ATGGAGCCGC GCGGGCCGCG GATGGTCTCG
GCCTGGGCCG GGCCGTTCTC CGGGTTCGTG CTCGCCGGGC TCTCCGGCAT CGTCCTGGTG
GCCACGCCGG ACGCGCCGTG GGCGGCAGTG GTGGCGATCT TCGGCACCGC TGCCCTGGCC
GTGAACCTGG CCCAGCTCAC GCCGCTCATC CAGCTCGACG GCTACTACAT GCTGATGGAC
TGGCTGGAAC TGCCGAACCT GCGAGCCAGG GCGCTGGGGT TCATCCGGGG GGAGCTCCCC
GGCAAGGTCC GTCGCCGGGA GCGGTTCGAC CGCACCGAAC GCGTTTTGAC GATCTTCGGA
CTGGCCGCCG CGGCGTACAC CGGCTACGTG CTCGTCCTGG CAGCCGGGTT CGCCTGGGTC
CGGGCGAAGA CGATGGTGAG CGACGCGGTC GCCGTGCCGA GCCTTGGCCG GGTCCTTGCC
GCCACGGTGG TTCTGCTCCT GACCCTTCCG CTGCTGTACG TGCTCGGGCA GCGGCTGTGG
CGGGGTGGCT CGTCGCTGGT GGTGGCGAGC CGGCGGCTGC GCCGGCTGGC CGCGGAGCGC
CGCTACCGGG AGCGGGTCGC GGTACTCGCG AACGTACCGC CGGTCGCCGA ACTGGGAAAG
GCGTACGTCC AGTGGATGGC GCGGCGGACC ACCGAGAAGG TGTTCCGGGC CGGCACCACG
GTGGTCGCTG CGGACGAGGT CGCCGAGGTC TTCTGCCTGG TGCTGTCCGG CGAGGCCGAG
GTCGTCGAAT CGGGCGCGAA TGCCGGGACG GGGCCGGGCG CGGGAAGTGC GCTGCGCACG
CTCGGCCCGG GCGACTACTT CCCGCCGCCC GGATTGCCAA GGTCGCCCTT GACCGTGCGG
GCGCTCACCG ACGTGCACGT GTTGCGGCTG GCCGGTGCCG ATTTCTCCGA CCGCCTCGCT
CCGCTGCTGG CCCGACGGGC CGAGAACGAC ACACGGGCCG ACGAACGGGC GGAACTCGAG
GGCTTCGCAC TGTTCGACGG CCTCCGCACC CGCGACAAGG ACACGCTGCT GGCCCACCTG
CGGGCACGGA GCTGTGTGGA CGGCGAGGTC GTCGTCGCCG AAGGCGACCC CGGTTCGGCC
TTCTACCTGG TTCGTAGCGG GGCCGTCGCG CTGGCACAGA CCTGCGTCGA CGGGCCGCCG
CGAACCCTGG GCGCCGGGGA ATTCTTCGGC GAGAGCGCAC TGCTGCACGA CGAGCCCCAC
GCCGCGACGG CCACGTCGGT CGGCGAGACC AGGCTGTGGG AGCTGGACCG GACGACGTTC
GACGACGTGG TGTGCCGCTA TTTCGGGCTG TCCGACGCCG TACGCGAATC CGCCGAAGCG
GGCGAAGCGA CGCGTGCCGC TGAGGCGCTG GCGGGCACGG TGGCCGGCCG CTGGCTTGAG
ATCGAGGTCG GCGACCCGGC ACCCGGGTTC ACCCTCGACA CCGCCAGCGG GCCGTCGCCG
GTGTCGTTGT CGGACTACCG GGGCCAGGTG GTGCTGCTCT GGTTCTCCCG CGGCTACAAC
TGCCCGTTCT GCCGGGAGTA CATGGCCCGG TTGGCGCCGG CGGTCGGCGA TTTCGAGCGG
GCCGGCGTGC AGATCCTGCA GCTGGCGCCC AACCTCGTCG ATTCCGCCCG CGAGTTCTGG
CGCGGCAAGG ACCTGCCGTT CCCGTTCCTG TGCGACCCGG AGAAATCCGC CTACCGGCTG
TGCGGCCTGC AGGACATCGG CGCCGGCGAA GCGCAACGCA ACTCGGTGCG GGGCTTCACG
CGGGCGTTCA CCACCGGCCA GGGCCGTACG ACGATGCACG CGCTCTGGCT TGACGTGGTG
AACCCGTCGA TCGGCGAACG GCTCGGCCAC CACACGATGA CGGCGATGCA GCAGGGCGTC
TTCCTGGTCG GTCCGGATGG CGTGCTGCGC CGCAAATACG TCTTCGGCCC ACTCGACGAA
CCGCCGTCGA ACACCGAACT CCTCGAAGCG GCCGCCGAAC TGGGATCGAT CGAGCTGGCG
ACAACACAAC TGGAGGCGGG CTCGTGA
 
Protein sequence
MTVSTQPQQR QAAPAPAATE TATVTVWDRL ADAANPARYR PKRQDGLIWR ELRSARGEEY 
VIVQNPDAAT YRKITMAEFY LFELMDGSRS VQDLVVAYMM KYHRFALPLV LRLVRNLKVG
QMLTDPPRFV FGPLGEQLRR RPVSSFATGF AKSFVRREFP LHGLDGLVGR AYDRGVWVLF
TRPAKIVMFA VAMLGVPLLA WSLSSSASGL ADQSLVVTVP TMYGCLLVVA VLHELAHAFA
AKSYGRVVRR GGLSIFYGSP GMFVDTQDMW MEPRGPRMVS AWAGPFSGFV LAGLSGIVLV
ATPDAPWAAV VAIFGTAALA VNLAQLTPLI QLDGYYMLMD WLELPNLRAR ALGFIRGELP
GKVRRRERFD RTERVLTIFG LAAAAYTGYV LVLAAGFAWV RAKTMVSDAV AVPSLGRVLA
ATVVLLLTLP LLYVLGQRLW RGGSSLVVAS RRLRRLAAER RYRERVAVLA NVPPVAELGK
AYVQWMARRT TEKVFRAGTT VVAADEVAEV FCLVLSGEAE VVESGANAGT GPGAGSALRT
LGPGDYFPPP GLPRSPLTVR ALTDVHVLRL AGADFSDRLA PLLARRAEND TRADERAELE
GFALFDGLRT RDKDTLLAHL RARSCVDGEV VVAEGDPGSA FYLVRSGAVA LAQTCVDGPP
RTLGAGEFFG ESALLHDEPH AATATSVGET RLWELDRTTF DDVVCRYFGL SDAVRESAEA
GEATRAAEAL AGTVAGRWLE IEVGDPAPGF TLDTASGPSP VSLSDYRGQV VLLWFSRGYN
CPFCREYMAR LAPAVGDFER AGVQILQLAP NLVDSAREFW RGKDLPFPFL CDPEKSAYRL
CGLQDIGAGE AQRNSVRGFT RAFTTGQGRT TMHALWLDVV NPSIGERLGH HTMTAMQQGV
FLVGPDGVLR RKYVFGPLDE PPSNTELLEA AAELGSIELA TTQLEAGS