Gene Franean1_2780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2780 
Symbol 
ID5671169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3287231 
End bp3290674 
Gene Length3444 bp 
Protein Length1147 aa 
Translation table11 
GC content75% 
IMG OID641241689 
Producthypothetical protein 
Protein accessionYP_001507109 
Protein GI158314601 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCTTG ACGACCTGAC CTGGCGGATC CGCGACCGGA TCGAGCAGTA CGACCGGACG 
AGGCAGGTCG CGGAGATCCT CGACGCCGAC GCCGACCGGG ACGCCGCCGC CCTGTGGCGG
CTGCTCCACG CCGTCGACCT CACCACCTCC TCCCCCGCGC TGGACGCGGC GTTCTCCATC
GCCAGCGCCA CCCTGGGCCG GTTGCACCAC CGCCGTTACC AGCTGCTGCC CGTGGGCGCC
GGTCTGTCCG AGCTGGCCCG GTGCCTGCTG TGCCTGGAAC CGATCTCCGA CGATCACGAG
GCCGTCCCGT CCGAGCTCGT GCCCGTGGTC GGGCGGTTCA CAGATCCCGA CGTCCAGGCC
GCGCTGGGCG TGCAGCTGCT CGATGCCGCG GCCGGAGGCG AGGATCCCGC CCTGCTCGAC
GCGGCCATCC TGCTTCTGGC CCCTGCCGCG ACCGCGCGGC CGCGGCAGGG TCAGGGGCCG
CGGCGGGGGC CCGGTCGGAG CGGCCGGCTC TCGGCGCTGT CCACCGCGTA CCGCCGACGG
CACGAACGCG ACGGCACCAC CACCGACCTC GACCGGGCCC TCGACACCGG CGAACGGGCC
GTGAGGCTCG CGGACCAGGA TGGCGGGGCG CCGGAGGTGT CCGTCCAGGC GTGGACCGCC
CTCGCCCGGG CCTACCGCTG CCGGTACCGG CTGCACGCCG ATCCCGCGGA CCTGCAGCGC
GTCATCGACC TGTCCGAGCG GGCCCTGGCG CATACCGGCC CCTCGGCGAA CCAGCTCGCG
GACCTGGCCA CCGCCTATCT GCACCGGCAC GAGCACACCG ACTCACCCGC GGACCTGGAA
CGGGCCGTGG ACCTGGCGGA GGACGCGGCC GCGCTGCCCG GCGGGCAGGA GGACCCCGAC
GTTCTGTCCG CCCTCGGCCG CGCGCTGTTG CGCCACTATG ACCGGTCCGG GCAACGCTCC
GAGCTGTGGC GGGCGGCCAC CCTCGCCGAG CAGGCCGCGG CTGCGCTGTC ACCACGCGAC
CCGCGGCGCG CCACCTACCT GTGCGCCGCC GCCGCGACCC TGCTTCGGCG GCACGAACGC
AGCGGGGCAC TCGGCGACCT GAACCGCGCC GTCGACCTCG GCCGGCAGGC CCTCGCGGCG
ATGCCGGAGA CCGACCCCGC CCGGGCAGAC GCCCTCGGCC GGCTCGCCGC CGCGCTGCAC
CGGCGTCATC GCAGCGCCGG CGCGGACACC GACCTCGACC AGGCCGAGGA CCTGGCGAGC
TGGGCCCTGG CAGCGATCCC GCCCGGGCAT CCGGACCGGG CCGGTGCGGC CCTGGAACGC
GCGGCCGTCC ACCTGACCCG CTACCGCCAC AGCGGCGTGA CCGCCGAGCT CGCGCGTGCG
ATCGAACTCG GCGAGCAGGT CACGGCGACG GACAGCACCT CCCTGCCGGG ATGGTGGTCG
CTTCTGGGTG ACGCCTACCA GCAGCGTCAT GCGATCAGCG GCGAGGCCAG CGACCTGGAC
CGGGCGGTGG AGCTCGGCGA GCGGGCCCTG GCGGCCACCC GCGAGGACGA CGTGGCACGC
GCCGAGCGGT ACGCCCGGCT GGCCACCGCG CACTGGCGCC GGCGCAGCCA CACGCCGGGT
GGCGCCGACC TGGACCGGGC GATCGACCTG AGGGAACGGG CCGTCGCCGG CACCCCCGCC
GACCACCTGG ACCTGCCGGA TCGACTGGCC GACCTCGCCG CCGCCCACCT CGACCGCTAC
CGTCTCACCG GCGCGGCCGC CGACCTCGAC ACCACTGTCA CTCTGTGCGA ACGGGCTCTG
GTGGCGCTCC CGGTCGACCA TCCACACCGC TCCCGGTTCA CCGCCAGCAT GTGCGTCGCC
TACCTGCAAC GGATCGCCGG CGCGGGCCAG GCCCCGGACC GGTCACGACT GCGGGAGCTC
GCCGACGGGA TGACCGGCGC CCAGGGCGCC GCCCCCGCCG ACCGGGTGTC TGCCCACCAT
GCCGTGGGCC GGCTCGCGCA GAGCGCCGGG CAGCCGGCGC TCGCCCTCGC GATGCTGGAC
GCGGCCGCCG CTCTCCTGCC GTCGGTGGCT CCCCGCGAGG CGGGCTGGGC CGACCAGCAG
TACCGGCTCG GTGAACACGG CGGCCTCGTC GGAGCCGGGG TGGCCGCGCA CTGCGCGGCC
GGTGACCCGG CGGGCGCCGT CGAGTTCGCC GAACTCGGCC GCGGGGTGCT CCTGGCGAGC
CAGGCCAACA CCCGGGCCGA CCTCGACGAG CTCGACGACC GGGCACCACG GCTCGCCGCC
CGCTTCCGCT GGGTCTGCGA GCGGCTCAAC ACCCCCGACT TCCCCGCCGA CGAACGCCGC
CGATGGTGGG CCGACTACGA CCGGCTCCTC GTCGACATCC GTGCGGTCCC CGGCCTCATG
CACTTCGTCG CGGCGCCGCA ACTGGCGGAG CTGGCCCCCG CCGCCGCGGG TGGGTGCGTG
ATCCTCGTCA ACGCCGATAC GCACCGAAGC GACGCCATCC TCGTGCGGGC CGACACCGAC
CCCGTGTCCG TCGCGCTGCC CGACCTGCGG CCGTCCGACG TCAACAAGCA GGTCACCGCC
TTGCTCGCCG CCCTCAACAG CGGCTCCACC CTGGCCGGGG CACTGCGTCG GCGTCTGGTG
GTGACCGCGG TGCTGGGCTG GTTGTGGGAC GTCGTCGTGG CCCCGGTCGC CGCCGCCCTG
CCTCCCGGCG ACACTGCCCA GCGGGTGTGG TGGCTGCCAA CCGGGCTCCT CGGACTGCTG
CCGTTGCACG CCGCCGGCCA CCCCGGCCAG GACGGCGCTC TCGACACCAT GATCTCCTCC
TACATCCCCT CGCTGCGGGC ACTGCGGGCC GCCCGCAGCC GCCCGCCGGC CCGACGGCGC
CAGAACCTGT CCGTCGTCAT GAGCGCCACC CCGGACATGC CGGAGCTACC CGGCGCCGAA
AAGGAGGCGG CCGTGGTGGA CGGCCCGTCC CTGCTCAACG CGGACGCGAC CGCAGATCAA
GTCCTGACCG CGCTACGGCA GACAACCTGG GCGCATTTCG CCTGCCACGG CGTGATCAAC
GCGACCTCGC AGGTCGACAG CGGTCTGCGG GTGCACGACC GCATCCTGAC ACTGCCCGAG
ATCGGCGGTC TGCGGCTGAC CGACGCCGAA CTCGCCTACC TGTCCGCCTG CTCCACCGCC
AACCACGGCA CCCGCTACGC CGACGAGGTG CTGCACCCGG CCGCCGCCTT CCAGCTCGCC
GGTTTCCGGC ATGTGGTGGC CAGCCTGTGG CCACTCGCCG ACGGTGACGC CGTGGACGCC
GCCCGCGCGT TCTACCAGCA TTTCGCCGAC ACTCCGGTCG CCGACCAGGC AGCCCCCGTG
CTGCATACCG TCACCCTGCG TCTACGGGAC CAGTATCCAG AACGCCCCGA CCTGTGGGCA
CCACTCGTCC ACAGCGGCCC CTGA
 
Protein sequence
MPLDDLTWRI RDRIEQYDRT RQVAEILDAD ADRDAAALWR LLHAVDLTTS SPALDAAFSI 
ASATLGRLHH RRYQLLPVGA GLSELARCLL CLEPISDDHE AVPSELVPVV GRFTDPDVQA
ALGVQLLDAA AGGEDPALLD AAILLLAPAA TARPRQGQGP RRGPGRSGRL SALSTAYRRR
HERDGTTTDL DRALDTGERA VRLADQDGGA PEVSVQAWTA LARAYRCRYR LHADPADLQR
VIDLSERALA HTGPSANQLA DLATAYLHRH EHTDSPADLE RAVDLAEDAA ALPGGQEDPD
VLSALGRALL RHYDRSGQRS ELWRAATLAE QAAAALSPRD PRRATYLCAA AATLLRRHER
SGALGDLNRA VDLGRQALAA MPETDPARAD ALGRLAAALH RRHRSAGADT DLDQAEDLAS
WALAAIPPGH PDRAGAALER AAVHLTRYRH SGVTAELARA IELGEQVTAT DSTSLPGWWS
LLGDAYQQRH AISGEASDLD RAVELGERAL AATREDDVAR AERYARLATA HWRRRSHTPG
GADLDRAIDL RERAVAGTPA DHLDLPDRLA DLAAAHLDRY RLTGAAADLD TTVTLCERAL
VALPVDHPHR SRFTASMCVA YLQRIAGAGQ APDRSRLREL ADGMTGAQGA APADRVSAHH
AVGRLAQSAG QPALALAMLD AAAALLPSVA PREAGWADQQ YRLGEHGGLV GAGVAAHCAA
GDPAGAVEFA ELGRGVLLAS QANTRADLDE LDDRAPRLAA RFRWVCERLN TPDFPADERR
RWWADYDRLL VDIRAVPGLM HFVAAPQLAE LAPAAAGGCV ILVNADTHRS DAILVRADTD
PVSVALPDLR PSDVNKQVTA LLAALNSGST LAGALRRRLV VTAVLGWLWD VVVAPVAAAL
PPGDTAQRVW WLPTGLLGLL PLHAAGHPGQ DGALDTMISS YIPSLRALRA ARSRPPARRR
QNLSVVMSAT PDMPELPGAE KEAAVVDGPS LLNADATADQ VLTALRQTTW AHFACHGVIN
ATSQVDSGLR VHDRILTLPE IGGLRLTDAE LAYLSACSTA NHGTRYADEV LHPAAAFQLA
GFRHVVASLW PLADGDAVDA ARAFYQHFAD TPVADQAAPV LHTVTLRLRD QYPERPDLWA
PLVHSGP