Gene Franean1_5483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5483 
Symbol 
ID5673814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6633043 
End bp6636342 
Gene Length3300 bp 
Protein Length1099 aa 
Translation table11 
GC content76% 
IMG OID641244338 
Producthypothetical protein 
Protein accessionYP_001509744 
Protein GI158317236 
COG category[S] Function unknown 
COG ID[COG3002] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.688472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.967507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTACG CCGCGGACCA GATCGAGATG ATCACCCGGA TCGTGGAGGA GGCGGGCGAG 
CTCCTGCCGC CGCAGGCCCC ACTGGGGTAC TTCTCCCATC ACAACCCGCT GCACGCCCTG
GAGGAGCTCC CGTTCCAGCG CGCGGTCGAG CACGCCTCGG CGATGCTGGG CACGGAGGCG
CTGCAGACCG AGGAGGCCTT CGCCGCGCAT CTCGCGTCCG GCCGGATCCT TCCCCGCGAT
CTCGCCGCGG TCCTCGAGCA TCACGGCGAC CGGGCGGGCG CTGTCCCCGG CCGGCATCTG
CCGGTGGGAG ACCAGGGCCC GCGCGGGGCC GGGGAGGAAA TGCTCGACGG TGACGCCGAG
GTCGTCCCCG GCGGGCCGAC GTGGAACGAG TTCCGGCTCG CCCGGCTCGG GCTGTTCATC
GACGTGCCGC GTGGCGCCGG GGCGTTGTGG GCGCTGGCCG ACGGCGGTGA GCTGCACCGC
GTCCACCCGC TGGTCACCGA GGCGCGGCGG GAGGAGCTGA CCCGCCAGGG CCGGCGCCGT
TTCGCGACCA CGGAGCGGCG CACCCGGCGC ACCCGGCGCA GCCGGGCGGC CCGGCTGCAG
GCGCTGCGGG CCCGGCTGCT CGCGCAACTG TGGGAGGACC TGCGCCGGCA CGCGCCGCCG
CCGGCGCCGC GGCCGGCGCC GCTGCGCCGC CGCGACCAGG TGCTCGAACA GTTCGGCGTC
GACACCGACG AGGCCGTCCA CCCGGTGCTG ATCCGGCTCT GCGCCGCCTT CCTCGACCAG
GGAGTCGCCG CCTGGGAGAT GCCGCACCGG GAGAAGGGCC TGCTGGCCGC CTTCCGGCAC
CTGTTCGGGA CACTCGGCGC CCCGCGGGAG GCGTGCTGGG CCGGCCTCGG CGGCCAGCTG
CGCCAGCAGC TCCGGATGAA CTGGTCCGCC GAGCGGACGG TCGCCTGGGC GCTGTGGGCG
CTGCAGGTGC CGGTGCACGC CTGGGCGGAC ACGGTGCGTG CGGCGCTGGT CTCGCTACGC
GGCTGGGCCG GGATGGTCCA CCAGTTCGAG TGCCGTCCCG ACCGGGCGCC GTCGCGCCCG
GCACCCGCCC GGCTCATGGA CTACCTCGCC GTCCAGCTCA CGCTCGAGGT CGTCGTCTCC
CACAACGTCC TGGCGCGGCT GATCGGCCCC GACGCGCGGC CGGAGGACCT CGGCCCGCTC
GGCCCGCCAG GCGCGGCCCA GGGTGTCCTC GCGACCGGCA CCACCGCCCA GGACGATCTC
ACCCAGGGTG AGCAGGAACT TCGCGGGGGC GACCTGGAGC TTGCCTACGA GGCGTTCGTC
CTCGCGCAGG TGATGGACGT CGAGACCGAG GTGCTCGGTC ATCCCCGGTG GGCGCGGGCC
TGGCTGCGGG CCGTCGCCGA GTTCGACGCG GGCCGGCGGC GCTGGCTGCT GCACCTCGCC
TACGAGCGCC GCTACCGGAC CCAGGTGCTC GACGCGCTGT CGGCGCACGA CCGGCGCTTC
CCCGGGACGG TGCCGCCCCC CGACTTCCAG GCCGTGTTCT GCATGGACGA GCGGGAGGAG
TCGCTGCGCC GGCACCTGGA GGAGAGCCAT CCCCAGGTCC GGACCTACGG CGCCTCGGGG
TACTTCGGGG TAGCCATGGC CTACCAGGGC CTCGACGACG TCCGCCCGCG TGCGCTGTGC
CCGGTCACGA TGACGCCGCG GAGCCTGGTC GTCGAGCGTG CGGTGGACGA CGGCGAGCTG
GTCGCCTACC AGCGGGCCCG GCGCAGGAAG GCCCAGCTCC AGCACACGAT CTCGGCGGCC
CGCGGCCGTC CGGCGCGGGC GGCGGCGTAC TCCGCCATCG CCGGGCTGGC CGAGCTGGTG
CCGCTGGCCG CCCGCGCGGT CGCACCCAGG GCCGCTGGCG AGGGCGTGCG GATGCTCGGC
CGCCGGGAGC CGGCGCGCCC GCTCGCCCGG CTCGTCATCG AGGCGGCGCA GGACCACACG
CCGTCCACCG GACCGGCGGG GGATGGCGCG CAGGCGGGGG AGGCGGGGGA GCTCGTGCCG
GGCGTGGGGG CGGGGCCGCT GCGCCTCGGG TTCACCGTCG AGGAGATGGC CGAGATCGTC
GACACGCTGC TCACGACGAT CGGCATGAGC GGCCCGCTCG GCCCGGTCGT GTTCGTGATC
GGGCACGGTT CGTCCAGCGT CAACAACCCG CACGCGGCGG CCTACGACTG CGGCGCCACC
GGCGGCGGCC AGAGCGGCCC GAACGCGCGC GCGTTCGCCG CCATGGCCAA CCATCCGCGG
GTGCGCGCCG CCCTGGCCCA CCGCGGCCGC CTGATCGGCC CGGACACCTG GTTCGTCGGC
GGCCACCACG ACACCTGCGA CAGCTCCCTG GCCTACTACG ACACCGACCT GGTGCCCGCG
CACCTGCGGC CGGCCCTCAC CGCAGCCACG GACGCGCTGC TGACCGCCGT CCAGCTCGAC
GCGCACGAGC GCTGCCGCCG GTTCGAGTCG GTCGGCCCGG ACGTCGCCGC CGGCACCGCC
CACGCCCACG TGCGGGGGCG CTCGGAGGAC ATCGGGCAGT CACGACCGGA GTACGGGCAC
AGCACGAACG CGACCTGCGT GATCGGCCGG CGCTCGCGGA CCCGGGGCCT GTACCTCGAC
CGCCGCTCCT TCCTGGTCTC CTACGACCCG ACCGCGGACC CCGACGGCGC CGTGCTCACC
CGGCTGCTCC TGTCGGCCGC CCCGGTCGGC GCCGGCATCA ACCTCGAGTA CTACTTCAGC
CGGATCGACC CGATCGGGTA CGGCGCCGGG TCGAAGCTGC CGCACAACAT CACCGGCCTG
GTCGGCGTGA TGGACGGGCA CGGCTCGGAC CTCCGGACCG GGATGCCCTG GCAGTCGGTC
GAGATCCACG AGCCGATGCG GCTGCTGGTG ATCGCCGAGG CGGAGCCCGA GCGGCTGGCC
CGCATCGTGC GGGAGAACCC GCCGCTGCGC GGGCTGGTGG AGGGCGGCTG GATCCAGCTC
GCCGCCTGGG ACCCGTCCGG CCCCGAGACC TACCTCTACC GCGACGGCGC CTTCGAGCAG
CACCAGCCCG AGAACCTGCG CTTCCCGGTG GTGGCCCGCT CGGAGCACTA CTACGCGGGC
CAGCGCGACC ATCTCCCGCC CGCGCACGTG CTCGCCGCCT TCGGTGAATC CCCGGACGCC
GTGATCGACC GCCCGGGCAC GGTGGCCGCG GCGGGCGCGG GAGCCGCCCA GCCCACGCGG
GACGCAATCG AGCTGCCGGA GCAGGCGAGC GGGCCGCTGC CCGCGCGGGA CGGACAGTGA
 
Protein sequence
MAYAADQIEM ITRIVEEAGE LLPPQAPLGY FSHHNPLHAL EELPFQRAVE HASAMLGTEA 
LQTEEAFAAH LASGRILPRD LAAVLEHHGD RAGAVPGRHL PVGDQGPRGA GEEMLDGDAE
VVPGGPTWNE FRLARLGLFI DVPRGAGALW ALADGGELHR VHPLVTEARR EELTRQGRRR
FATTERRTRR TRRSRAARLQ ALRARLLAQL WEDLRRHAPP PAPRPAPLRR RDQVLEQFGV
DTDEAVHPVL IRLCAAFLDQ GVAAWEMPHR EKGLLAAFRH LFGTLGAPRE ACWAGLGGQL
RQQLRMNWSA ERTVAWALWA LQVPVHAWAD TVRAALVSLR GWAGMVHQFE CRPDRAPSRP
APARLMDYLA VQLTLEVVVS HNVLARLIGP DARPEDLGPL GPPGAAQGVL ATGTTAQDDL
TQGEQELRGG DLELAYEAFV LAQVMDVETE VLGHPRWARA WLRAVAEFDA GRRRWLLHLA
YERRYRTQVL DALSAHDRRF PGTVPPPDFQ AVFCMDEREE SLRRHLEESH PQVRTYGASG
YFGVAMAYQG LDDVRPRALC PVTMTPRSLV VERAVDDGEL VAYQRARRRK AQLQHTISAA
RGRPARAAAY SAIAGLAELV PLAARAVAPR AAGEGVRMLG RREPARPLAR LVIEAAQDHT
PSTGPAGDGA QAGEAGELVP GVGAGPLRLG FTVEEMAEIV DTLLTTIGMS GPLGPVVFVI
GHGSSSVNNP HAAAYDCGAT GGGQSGPNAR AFAAMANHPR VRAALAHRGR LIGPDTWFVG
GHHDTCDSSL AYYDTDLVPA HLRPALTAAT DALLTAVQLD AHERCRRFES VGPDVAAGTA
HAHVRGRSED IGQSRPEYGH STNATCVIGR RSRTRGLYLD RRSFLVSYDP TADPDGAVLT
RLLLSAAPVG AGINLEYYFS RIDPIGYGAG SKLPHNITGL VGVMDGHGSD LRTGMPWQSV
EIHEPMRLLV IAEAEPERLA RIVRENPPLR GLVEGGWIQL AAWDPSGPET YLYRDGAFEQ
HQPENLRFPV VARSEHYYAG QRDHLPPAHV LAAFGESPDA VIDRPGTVAA AGAGAAQPTR
DAIELPEQAS GPLPARDGQ