Gene Franean1_5035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5035 
Symbol 
ID5675748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6033731 
End bp6036532 
Gene Length2802 bp 
Protein Length933 aa 
Translation table11 
GC content72% 
IMG OID641243886 
Producthypothetical protein 
Protein accessionYP_001509301 
Protein GI158316793 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.103553 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGCGC GCCTCGTCGC TGTTGCACCG CCGCTGCTCG CGCGCTGGCG GGAGGCCCGG 
CAGCGCCGGC TGTGGGCGGC GGCGGACCAT CCAGCCCCCC CGGGGGCCAC CACGCCACTG
GTGATCGATC CCGACCTGCT GGTCGCGGCC GACCTGCGTA ACCGGGTGGC CGGTGATCCG
GCGTTCGACC TGTGGACCGC CCGCCGGCAG TGGGTCGACG CCACCTTCGC AGCCGTCCGC
GCCAGCCGGG CGGCACTCGG CGGGGCTGCC CCGGCCGACG TATTCGACAG GGTCGTCGCC
GACGTCGTGG GCGCACCCCT CAGCGAGCTG ACGGAGCTCG ACCAGCGGCG TCGGCGCGCG
AGCTCGATTG ATGCCGAACT GGACGCGCTC CATCTGCTCC CGGCGGAGCT GGCGCGGCTG
GTGCGGCTGC GTGAGATGGC CGCCACCGGC ATCGTCACCG ACGACGAGTG GCGGGAACTC
GACCACCTGC TCACGCAGGT GCGTAAGCGC CAGCGGAGGG CGGACTGGCT CGCAGCAGAG
GCCGCGATCG CGCTGACGCC GGAGCACTTC GTGCTGTCGA CCGGGCCGTG GTCGCCGGTT
GCGTGGCGCG CCAGCGCCGT CGAGCGGGCG GATTGGCTGG ACCGCCTGCA GGCGCGGATG
GACCAGGAGA GCGCGGTCCG CGAAGCCCTG CGGACGGCGG TCAGCGAGGT CGAGCAGGCG
GCGCTGCCCC GGTGCCGGGA CGGCCTGATC TCGCTGCTCG GCACGACCGG ACCGGCCGAG
GAGGCTGACC GGCTGACCGA ACTGCTGCTG GTCGACTTCA GCGGCGGCGG CACGGAACTG
ACGACCCGTG TGGACCAGGC AATCGAGACG GTGCAGTTGC TGTTCCTCGG CCTGCGGTCC
GGCCGACTGC CGTCGGCGCA CCCGGCGCTG GCCTGGACCA TCGAGCCGGG CGGCGCGGCC
AGTCTCGACG AGGAATGGGC GTGGATGGGG GGCTACGACT CCTGGCGCGC GGCCCTGTTC
CTGTTCACCT ATCCCCAGGA CCTGCTCGCC CCCTCGGTCC GCGACACCAG CGCCGCGGTC
CCCGAGACGG CCCGCCCCAC GCCCCCCTTC CGAACGCTGG TCACCGCGCT GCGCGGCCAG
CAGCGGCCAC GTCGCGACAT CGTGCTGACC GCGGCAAACG AGTTCCTGGC GACGATCCGG
CCGACGCTGG TGAACTTCCC GATCCGGCCC CTGCCCTACA CCGCCGGCGA CGAGGGCCTG
CGCGACGGCC TGCTGGCGTT CGGGTACACC GACCGTCCGA CCGCGGCTGA CCTGGCCCGC
ATCCGCGACC TGTCCGCGCG TGTGCTGATC ACCCTTGCGG TCGGACACCC GGGACTTCCG
AACTACCTGC AGGAGCTCTT CTACTATGTG CCGCTGCTGG TGGCGCAGAA CCTGCAACGT
GGCGGCGACT ACGTGGCCGC GCTCGACTGG TACCGCACGA TCTACGCCTT CGACCAGACT
GACCTGCCCG ACTTCTTCGT ACCCTCGGAC GAACGCAAGA TCTACTACGG GCTTCAGCAG
GAAGGGAACG CGCCGGACGC CCTGTCGCGT GGCATCCACT GGCTGCGCGA CGAGGTCAAT
CCGCACGCCC TGGCCGGTCA GTGGTCCAAT CCCTACACGC GGTACACCTT CACCTCGATC
GCGCGCTGCT TCCTGGAATA CGGGGACTTC GAGTTCACCC GGGACACCGG CGAGTCCCGG
GCCAGGGCCC GGTCGTTGTA CCTGACGGCG CGGTCGCTGC TGCGCGCGCC GGAGCTGGCG
AACCTCGCGG GCCCGGGCCT GGGCGTGGCG GTCCCGAGCC CCGTGCTGAC CGCGTTACTC
ACCCGGGTTG ACAACCAGCT CCGCAAGCTT CGGCAGGGCC GTAACATCGC CGGCCTGCGC
CGTCCGGTCG AGCCTCCGGA CGCGGCCGCG TCGACGTCGG CCGGCATGCC CTATATCGGC
GCTGGCGGTC AGCTGGTCAT CCCCGCGCTG AACCCACCGC GACCCACGCC GTACCACTTC
ACGGTGTTGC TGGAGCGAGC CCGGCAGCTT GCCGCCACCG CCGCCCAGGT GGAGGCGTCC
TACCTGAGCG CGCTGGAGAA GCGCGACGCC GAGGCCTACG GCCGGTTGAC CGCCGGCCTG
GATCTGGACG TCGCTCGGGC CGGCGAGGCC CTCCAGTCGT TGCGGGTCAG CGAGGCCCAG
AAGGGCACCG AGCTTGCCCG CCGGCAGAAG GCCGCCAGCG ACGTACGCGC GACCACGTTC
CAGCAGTGGA TCGACGACGG CCCGAACGAA TGGGAGCGCA GCCTCGTCCG GGACTACGAC
GAGGCCCGGG TCTACCGTGA CTGGATGGTG GGACTGGACG CGGCCATCAC GGCGGCGCAG
GCTGTCGCCT CCGTCTCCTC CATCCCGGGT GCGGCCGCCG CGTCGACGGT TGGCGGGCTT
GCCGTGGGCC GGGCCGTGAA CGCGACGAGC CTCAACCGCA CCGAGCAGGA CATCGCGCTG
AACACGCTGC GCGCCAGCCA GGAACGCCGC CAGGACGAGT GGGAGCTCCA GCTCGCGACG
GCCACCCAGG ACGGCCTGGT CGCGCAGGAG GCGATCCTGC TCGCGCTCGA CCACGAGGCC
ATCGTCGGAC AGGAGGCCCG GATCGCCGGC CTCCAGGCGT TCGAGGCCCA GTCGGTGACC
GACTTCCTGA CCCGCAAGTT CACCAGTGCC GAGCTCTACG AATGGATGAG CGGCGTCCTG
GGGAACACTT CCTCCAGCAG GCCACCGGCA CGGCGCTGCT AG
 
Protein sequence
MSARLVAVAP PLLARWREAR QRRLWAAADH PAPPGATTPL VIDPDLLVAA DLRNRVAGDP 
AFDLWTARRQ WVDATFAAVR ASRAALGGAA PADVFDRVVA DVVGAPLSEL TELDQRRRRA
SSIDAELDAL HLLPAELARL VRLREMAATG IVTDDEWREL DHLLTQVRKR QRRADWLAAE
AAIALTPEHF VLSTGPWSPV AWRASAVERA DWLDRLQARM DQESAVREAL RTAVSEVEQA
ALPRCRDGLI SLLGTTGPAE EADRLTELLL VDFSGGGTEL TTRVDQAIET VQLLFLGLRS
GRLPSAHPAL AWTIEPGGAA SLDEEWAWMG GYDSWRAALF LFTYPQDLLA PSVRDTSAAV
PETARPTPPF RTLVTALRGQ QRPRRDIVLT AANEFLATIR PTLVNFPIRP LPYTAGDEGL
RDGLLAFGYT DRPTAADLAR IRDLSARVLI TLAVGHPGLP NYLQELFYYV PLLVAQNLQR
GGDYVAALDW YRTIYAFDQT DLPDFFVPSD ERKIYYGLQQ EGNAPDALSR GIHWLRDEVN
PHALAGQWSN PYTRYTFTSI ARCFLEYGDF EFTRDTGESR ARARSLYLTA RSLLRAPELA
NLAGPGLGVA VPSPVLTALL TRVDNQLRKL RQGRNIAGLR RPVEPPDAAA STSAGMPYIG
AGGQLVIPAL NPPRPTPYHF TVLLERARQL AATAAQVEAS YLSALEKRDA EAYGRLTAGL
DLDVARAGEA LQSLRVSEAQ KGTELARRQK AASDVRATTF QQWIDDGPNE WERSLVRDYD
EARVYRDWMV GLDAAITAAQ AVASVSSIPG AAAASTVGGL AVGRAVNATS LNRTEQDIAL
NTLRASQERR QDEWELQLAT ATQDGLVAQE AILLALDHEA IVGQEARIAG LQAFEAQSVT
DFLTRKFTSA ELYEWMSGVL GNTSSSRPPA RRC