Gene Franean1_5349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5349 
Symbol 
ID5673683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6448589 
End bp6452914 
Gene Length4326 bp 
Protein Length1441 aa 
Translation table11 
GC content73% 
IMG OID641244207 
Producthypothetical protein 
Protein accessionYP_001509613 
Protein GI158317105 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.598215 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGCAA CGCACCGCGG ACTGGCTGGC CGAGCATGCG ACCGGCCGGG TGGTCTTGGC 
CGGGTAGCAC CGCCCATCGC TGGCCGAACG AAACACCGCC CCACCGGACC TTCCAGGGCG
GGGCTGTTCA CTGATCATCG ACTGTACGAT GACTCGCCCG GCACGGCGTT CGTGGGGACG
GGGGAGTCGC CGATGGTACT GATGCCGTGG GAGCGTGAGC TGCTGGCCAG GCGGCAGGAC
CGCGCCCTCG AACTGGCCGT GCGTGCGAAC GCGCTGTTGG ACGCGGCACG TCTCACGGGC
GACGAAGCCG CGCTGAACAC GGCGATCGCC CTGTTCCGCC GCGCCAACGG GACCATCCCC
ACCGGTCATC CCGGCCAGGC CGCGGTCAGC CACAGCCTCG GCAACGCCCT GCAGACCCGC
TTCGGTTGGC ACGGGCAGGG CCGGGACCTG CAGGAGGCCG TCGACCTGCA TACCGGAGCC
GTGGCTGCCA CCGCGCACGA CGATCCGGCC CGGCCCGTGG CACTGGCGAG CCTCGGCGCG
GCCCTGCAGA CCCGTTTCGA ACGGATCGGC GATCTCGCCG ATCTCGACGC CAGCATCGAC
CTGCAGCGAG AGGCGGTCGC CGCAGCGACC GATCCGGATC ACGCTTCCGC GCGGAGTCGC
CGTCCGGCCG TATTGCCCGA CACCGCGATT CTGTGGTCCA ACCTCGGCAA CGCGCTGCGG
CTCCGCTTCA CCTGGACAGG CCGCGAGGAC GACCTTGACG CAGCGCTCGG CGCCGGCTGG
ACGTCGCTGG CGGAGCCGGC GTCCCGCACC ACCGACCACG CCCGGCACCT GTCCAACCTC
GGCAACACCC TGGCGACCCT GTTCGAGGCG CGGGGCCGCG CGGCGAACCT GGACGAGGCG
ATCACCTGCT TCCGCGACGC CGTCGCCGCC GTCTCCGCGG ATCATCCCGA CGCGGCCGGA
TATCTGGCCA ATCTGTGTGG CGCGCTGTGG ACGCGGGCCC GTCACACCGG ACGCCAGACC
GACCTCAACG AGGCACTGGA GCAGGGCCGA CGAGCGAAGG CCGCCATCCC GCCAGACCAC
CCCGCCAGGC CCCGAATACT GGCGGACCTC AACGGCGCGC TGCAGACCCA CTTCGAGTGG
ACGGGCAGCG ACACCGACCT GGACGACGCG GTCGCCGTCA GCCGTGACGC CGTCGCCGCG
GCTCCGCCGG GCCATCCCGA CCGGGCCGGG CACCTGTCCA ACCTCGGCAT CGCGCTGCAG
CGCCGGTTCG AACACCGAGG CGACCCGGCT GACCTCGACG AGTCGGTCGA CCTCCTGCGC
GAGGCCGTGG CGGCGACCCC CGACGGACAT CTCAAAGCCA CCCCCCACCT GGCGAATCTC
GCCAACCTCC TGCGTCGTCG GTTCGAGCTG ACAGGTGGCA GGGAGGATCT CGACGCCGCG
GTCGGCCTGC TGCGCCGGGC GGTCGCCTCA TCGTCGGAAG GCAGCTCCGG CCACGCCGGG
CTGCTGCTGA ACCTGAGCAT CGCCGAGCAG GTCCGGATCG CCTCCCAGGG GCGCGACGGA
GACCTGGCCG GCCTCGACCA GGCGATCGCA CTCGTGCGCA CGGCACTCGC CGTCACGCCG
CCGGACCGCC CGGATCACGC CCGGCTGCAG TCGACACTCG GCCTGGCCCT GCGGGCCCGC
TTCGAGTGGA CCGGGCGGCA AACCGATGTC GCCGAGGCGA TCGGCGCCCT GGAGACCGCG
GTGGCGGCGA CCCCCTCGAC GCACCCCGCC CGGGCCAGCT ATCTGTTCAA CCTCGGCTGC
GCGCTGCAGA CCCGCTTCGA GCGCACCGGA CTGGCCACCG ACCTGGAACA GGCCGTGGCC
GCGATGGAGA CGGCGGCCGC AGCCAGCCCC GCGCAAGGCG CGGACCGCGG CCGGATCCTC
TCCGGTCTCG GCCTCGTGCT GCAGCACCAC GCCCAGCAGT CAGGCGACCG CGCCCAACTG
GAGAGGTCGA TCGAGGCTCT GAGGTCAGCG GCCGCCGCGA CCCCACGGAC AGGCCCCGAG
GCGGCCGGAG TCCTGTCGAA CCTCAGCATC TCGCTGTGGC GGCGCTCGGT CTGGGACGAG
GATGGCAACG CCGACCTGGA CGAGGCTGCC GCGGCCGGCC GCGCAGCGGT GGACGCGGTG
GTCACCGACC ACCCGCTGCA TGCCCGGTAC CTGTCGAATC TCAGCAACCT GCTGCAGACC
CGTTTCGAGC GTTGCGGCGC CCCGGCCGAC CTCGACGACG CCATCGACCA TTGCCGGGCG
GCGCTGGCGG CCACCCCGAC CGACCACCCG GACACCGCCC GTTACCTGGC CAGTCTGGGA
CTCGCGCTGG GTCTGCGGTA CGCGCGCATG AAGCAGCCGG ACGACCTGGA TGCGGCGGTC
GGCGCCGGGC GCCGGGCGGC GGCGGTCGAG GCCGCGTCGC CGCGGGTCCG CGCGCACGCC
GCGCGTGGGT GGGCGATGAC CGCCGCCGCC GCAGGGCGGT GGGCCGAGGC CGCGGACGGC
TTCACCGCCG CGATCGACCT GCTGGGACAG GTCGCGCCCC GCGGGCTGGC CCGCGGGGAC
CAGGAGGGCC TGCTCGACGA GCAGGCCGAT CTGGGCTCCG ATGCGGCGGC CTGCTGCGTC
CGCGTCGGTC ATCCCGCTCG GGCGGTGGAA CTCCTGGAAC AGGGCCGGGG AGTGCTGCTC
GCACAGGCCT TGGACAGCCG CACCGACCTG GGCGCGCTCA GGGAGATCGA CCCGGCGCTG
GCGGAGCGGT TCGCGCAGTT GTGCGCGCGC CTCGACCGGC CCGGCCCACT GGACGCGGGC
TTCGCCAACA TCGCTGGCGC ACTGGGCGGA GCCATGGGCC TTCCCGGTGC CGGGGCTGGC
GTCGATTCCG GGGCTGGGGC TGGGGCTGGT GGCATGGCTG GGTCGGCTGC CGGTGCCGAT
CCGACGCTTC GTCTCGACCG GGTCGACGCG GAGGAACGCC GCCAGGCCGG CCTGGCGCTC
ACCCGCACCA TCGAGGAAAT CCGTGTGCTG CCCGGCTTCG ACAGCTTCCT GCGGCCTCCC
ACACTGGCCG ATCTGCGCAC TGCGGCCTCG GCCGGGCCGG TCATCCTCCC TGTGGTCTCG
TCGTTCGGCT CGTACGCGTT GATCGTCACC GAACGCGGGC TGCTGGACCC GGTGGAGCTG
CCCGACCTCA CCCCGGACGC GGTCAGGGAC CGGATCATCG CCTTCCTGAC CGCGCTCGAC
GATCCGCAGG ACCAGGAAGG CCACAAGCGG CTGGCCGAGA CCCTTGGTTG GCTGTGGGAC
GCCCTCGCCG GCCCGGTCCT CCAGCGCCTC GACGCCCTCG GATTCCTCGC CCTGACACCG
GCGGCCTCCC CCGCGTGGAA GCGCCTGCGT ACGAAGTGGC AGCCGGAGGC AACCGGCCCC
GAGCGGTCCG CGGGTCACGC GGCCTCCGCC GGGCGGCCGC GGCTGTGGTG GTGCGCGTCC
GGCATGCTGG CCTTTCTGCC CCTGCACGCG GCCGGGCACC ACGAGACCCG GCACCGCGGC
GCTCCCCACA CCGTGCTGGA CCACGCGATC AGTTCCTTCA CCCCCACGCT GCGCGCGCTC
ATCCACGCCC GCCGGCCGGT GCCTGCGCAC CCTCCGGCCA CGGGCGCTCC GGCGGTCACC
ACGGACGCGC GGGCAACCGG AACGCCGGTC ACGCCGGTCG CGCCGGTCGC GTCGGGACCG
CCGGCCGCGT CGAGACGCAT CGTGGCCGTC GGCATGCCAC GCACTCCGGG CGCCGGCGAC
CTGCCGGGCG CCGAAAAGGA GATCCACCGA CTCGAAACAC TCTTCCCGGG GCAGGTCCGC
ACGCTGATCG CGCATGAAGC GACGCACGCC GCGGTGGCCG CGGCCCTCCC GCACGCCCAC
TGGGCGCACT TCTCCTGTCA CGGCTACGCG GACCTGCGCA ACCCCTCGAA CAGCCGACTG
CTGCTCACCG ACCACGAACG CGACCCGTTC ACGGTGGTGG ATGTCGCCCG GCTGCGCCTC
GACACCGCCG TGCTCGCATT CCTGTCCGCC TGCGAGACCG GCCGACCCGG CGGCCCGGCC
GACGAGGGAA TCCACTTGGC ATCGGCGTTC CAGCTGGCCG GCTTCCGGCA GGTTATCGCC
ACCCTGTGGC CCGTCAGCGA CAGTGCCGCG GCCGAACTCG CCGAGGAGCT CTACGACGCC
CTAGCCCTGG CACCACCCGA CTCACTGGAC GTTGCCGCTG CCCTGCACGA GGTGACGCTC
GGCCTGCGCG GCGTGTGGGC CGACCAACCG GAGGTGTGGG CGTCCCACAT CCATTCCGGG
GCCTGA
 
Protein sequence
MAATHRGLAG RACDRPGGLG RVAPPIAGRT KHRPTGPSRA GLFTDHRLYD DSPGTAFVGT 
GESPMVLMPW ERELLARRQD RALELAVRAN ALLDAARLTG DEAALNTAIA LFRRANGTIP
TGHPGQAAVS HSLGNALQTR FGWHGQGRDL QEAVDLHTGA VAATAHDDPA RPVALASLGA
ALQTRFERIG DLADLDASID LQREAVAAAT DPDHASARSR RPAVLPDTAI LWSNLGNALR
LRFTWTGRED DLDAALGAGW TSLAEPASRT TDHARHLSNL GNTLATLFEA RGRAANLDEA
ITCFRDAVAA VSADHPDAAG YLANLCGALW TRARHTGRQT DLNEALEQGR RAKAAIPPDH
PARPRILADL NGALQTHFEW TGSDTDLDDA VAVSRDAVAA APPGHPDRAG HLSNLGIALQ
RRFEHRGDPA DLDESVDLLR EAVAATPDGH LKATPHLANL ANLLRRRFEL TGGREDLDAA
VGLLRRAVAS SSEGSSGHAG LLLNLSIAEQ VRIASQGRDG DLAGLDQAIA LVRTALAVTP
PDRPDHARLQ STLGLALRAR FEWTGRQTDV AEAIGALETA VAATPSTHPA RASYLFNLGC
ALQTRFERTG LATDLEQAVA AMETAAAASP AQGADRGRIL SGLGLVLQHH AQQSGDRAQL
ERSIEALRSA AAATPRTGPE AAGVLSNLSI SLWRRSVWDE DGNADLDEAA AAGRAAVDAV
VTDHPLHARY LSNLSNLLQT RFERCGAPAD LDDAIDHCRA ALAATPTDHP DTARYLASLG
LALGLRYARM KQPDDLDAAV GAGRRAAAVE AASPRVRAHA ARGWAMTAAA AGRWAEAADG
FTAAIDLLGQ VAPRGLARGD QEGLLDEQAD LGSDAAACCV RVGHPARAVE LLEQGRGVLL
AQALDSRTDL GALREIDPAL AERFAQLCAR LDRPGPLDAG FANIAGALGG AMGLPGAGAG
VDSGAGAGAG GMAGSAAGAD PTLRLDRVDA EERRQAGLAL TRTIEEIRVL PGFDSFLRPP
TLADLRTAAS AGPVILPVVS SFGSYALIVT ERGLLDPVEL PDLTPDAVRD RIIAFLTALD
DPQDQEGHKR LAETLGWLWD ALAGPVLQRL DALGFLALTP AASPAWKRLR TKWQPEATGP
ERSAGHAASA GRPRLWWCAS GMLAFLPLHA AGHHETRHRG APHTVLDHAI SSFTPTLRAL
IHARRPVPAH PPATGAPAVT TDARATGTPV TPVAPVASGP PAASRRIVAV GMPRTPGAGD
LPGAEKEIHR LETLFPGQVR TLIAHEATHA AVAAALPHAH WAHFSCHGYA DLRNPSNSRL
LLTDHERDPF TVVDVARLRL DTAVLAFLSA CETGRPGGPA DEGIHLASAF QLAGFRQVIA
TLWPVSDSAA AELAEELYDA LALAPPDSLD VAAALHEVTL GLRGVWADQP EVWASHIHSG
A