Gene Franean1_5301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5301 
Symbol 
ID5673635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6378089 
End bp6382027 
Gene Length3939 bp 
Protein Length1312 aa 
Translation table11 
GC content74% 
IMG OID641244158 
Producthypothetical protein 
Protein accessionYP_001509565 
Protein GI158317057 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.437695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGCCG GCTTCGTCCT GCTGTTCCTC CTCCAGGCAC CGGGGAAGCT GACGGCCGAC 
ACCAAGCTCG ACGTCCCGCT CGAGCCGTGG CGGTTCATGT CGGCGGCCAC GCACCTGTGG
AACTCGACGT CCGACTTCGG CTTCCTGCCG AACCAGTACG CCGGCTACCT GTTCCCGATG
GGCCCCTTCT TCGGGCTGGG GAACCTGCTC GGCGTGCCGC CGTGGATCAC CCAGCGGCTG
TGGATGGCGG TGCTGCTCAC CACCGCCGCC TGGGGCACGG TCCGCCTCGC CGAGGCGCTC
GGGATCGGTC GGCCGAGCGC CCGGTTCCTG GCCGGCCTGA GCTACGCCCT GTCGCCCATG
TTCCTCGGCA AGGTCGGCGC GACGTCCGTC GCGATGGTCG GGGCGGCGAT GCTGCCCTGG
ATCACCCTGC CGCTGATCCT CGCGCTGCGC CCCGACGGCG CCGGCGGGGC GGACACGGGC
CACCGTGACG ACACCGGGGA GCGGGCCCGG GAGCTGGCCG CCGCGCGTCT GTCGCCGCGG
CGCGCCGCGG CCCTGTCGGG GCTGGCGGTG CTGTGCACCG GTGGCATCAA CGCGACCGTC
ACGCTGTGTG TGCTGCTCTG CCCGGCCGTC GTCCTGGTGT TCGCGGGCGC GACACGGCGG
GCCTGGGCAC TGCGGGCATG GTGGTGCGTG TGCGTGGTGC TGGCGTGCGC CTGGTGGATG
CTCGCGCTGG CCGTCCAGGG CCGCTACGGG CTGAACTTCC TCCCCTTCAC CGAGACCGCC
GACACCACGA CGGGAACCAC CTCGGTCGGC GAGACGCTCC GCGGCGCCGC GGACTGGATG
GCCTACCTCT CGCTGCCGAC CCCCTGGCTG CCGGCGGCCC AGGAGTACGT GAGCACCCCG
CTGGCGGTCG TCGCCTCGGC GGTGGTGTCC GCCTTCGGTC TCGCCGGCCT GGTCCGCCGG
GATCTCCCAG CCCGCCGCTT CCTGCTCGTC ACGCTCGCCG TCGGGGTGGT GTCGGTGGCG
GCGGCCTACC CGGGCCAGCC GGGCAGCCCG CTCGCGGACG GCGTCCGCTC GCTGCTGACC
GAGCCGTTCG GGTTCCTGCG CAATGTCTAC AAGTTCCAGC CGGTCGTCCG GCTGCCGCTG
ACGCTGGGCC TCGCGCACCT GCTCGGCGTG GCCCTCTCGT GGCGTCCCGG CGCGGTCCGG
GCCGGCGCCG CCGGCCCGCC CGAGGACGCC GGGCCCGCGG GCAGGGCACG GCGCGGCGCG
GAGGGCGCCG GCGATGATCG GCGCAGGCTC GTCCCGGTGC TGGTCATCCT GGTGACGGTC
GGGACGCTGG TCGCCGGGAT GGCGCCGATG CTGCGTGGCC AGGGCCTGCA GCCCCGCCCC
TTCACGAAGG TGCCGGACTA CTGGGCCCAG GCGGCCGACT GGCTCGCGGA CCATCCCGAG
GGCGGGCGGG CCCTCGTCCT GCCGGGCGCC CCGTTCGGCG AGTACCAGTG GGGCCGCCCC
CTCGACGAGC CGCTGCAGTG GCTTGCCCGC ACCCCCTGGG GGGTGCGCAA CATCATCCCG
CTGGGCGGTG TCGGGACGAC CCGCCTGATG GACGGCATCG AGCACATGAT GGCCACCGGC
TCCACCCCCG GGCTCGGGGT GACGCTGGCC CGCGCGGGGG TCGGCCAGGT GCTCGTGCGT
AACGACATCG AGCAGAAGGA CTGGGACATC CCGCCGTCGA CGGACCAGCT ACACCGCTCG
CTGGAGAGCT CGGGCCTGGT CCGGGCGGCG TCCTTCGGGC CCGAGGTCCA GGCCCGCACC
GCCGCCAAGG CCCGGCTCGT CGAATCCCTG AGCGAGTCCG CCGAGAAGGT CCCGGCGATC
GAGATCTGGA CGGTGCCCGG GGGAGCCAGC ATGGTCCAGG CCTACCCGGT CGACACCGGG
GTCGTCGTCT CCGGCGGCCC CGAGGCCACC GTGCAGCTCG CCGGCCAGGG GCTGCTCAGC
GCCGATCGAG CCGTGGTGCT GGCCGGGGAC CTCGCCGAGC CGGACCGCCC CGCCGGCGGC
GCGGCGGCCG GCGCGGCCGA CGACACCGCG ATCACGCGGG CGCCCATCGC CGATCCTCCG
CCGGCGTCCC AGGTCGTCAC GCCCACGACG GCCTGGGCCG TCACCGACAC CAACACCCGC
CGTGGCTACA CCTTCGGCAT CGTGCACGAC TCGGCGTCCT ACCTGCTCGG CCCGGACGAG
ACCGTCGCCG GCCGGCCCGG CCCGCCGAGC CAGTGGGTCG ACCGGCCGGT GGTCGGTCAC
CAGACCGTCG CCGGCTACGC CGACGGGATG TCCGTGCAGG CCTCCTCCTA CGGCTACGAC
CTGCTCGCGG CTCCTGACTT CGCGCCGTCC GCGGCCGTCG ACGGGCAGAC CTCGACCTCC
TGGACGGCGC TGCGCCGCCA GGGCGCGACC TCCCAGGGAC AGTGGATCCA GCTCGACGTC
GGCCGCGAGA TGTCCGTGCC CTACATCGAC ATCCGGCTGC TGCAGGAGGG CGACTGGCGC
CCGGAGGTCG AGGCGCTGCG GGTGACCACC GAGCGCGGGT CGGCGGTCAC CCAGGTCTCG
CCGATCGAGG ACATCCAGCG GCTGGCGGTG CCCCCGGGGA TGAGTCGCTG GTACAGGATC
ACCTTCGACA AGGTCAGCCG GGAGACCGAC CCCGTCCTCG GGGCCGGCCT GCGCGAGATC
GAGATCCCCG GCGTGCGGTT CCAGCGGTAC GCCCAGGCCC CGGCCGACAT GGTCGACGAG
TTCCAGGCGC CGGACGAGGG GCTGGTCGCC TACTCCTTCG AGCGGACCAG GGTCGACCCG
CTCCAGCCCT TCGGCGGATC CGAGGAGATC ACGCTCTCCC GGCGGTTCGA GGTGCCCCGC
CGCCTCACCT TCACCCTCAC CGGCACCGCG AGCGCCCTCC CGCCGCCGGC CGGCGCCGAA
GTCGACTCCT CCGACGATCC GCTGGTCATC CCATGTGGCC AGGGGCCGGC CCTGACGATC
GACGGAGTCC GCCACGACAT CCAGGTCGAG GGCAAATACA GCGACCTCGC CACCGCGCGG
CCGTTCCGCA TCAGCCTGTG CTCGGAGGGC CACCAGATCA CCCTGGATCC GGGGCAGCAC
CTGATCACCG TCGACCTCGG CCAGTCGACG ATGCTCGTCG ACTCGCTCAG CCTGGTCGGC
ACCACCGCCG CGACCAGCAC GGAGAAGCCA CGGACGACCC GGATCGGGGA GTGGGGCGCC
GAGCGGCGGA CGATCGAGAT CGGTGCCGGC GCCAGATCCT TCGTGTCGGT GCGGGAGAAC
GCGAACGCGT CCTGGACGGC GACCCTGGAC GGGAAGCCGC TCACGGCGGT CCGGCTCGAC
GGCTGGGCGC AGGGCTGGAT CGTGCCGGCG GGGGCCGCCG GCACGATCGT GATCGAGAAC
CTTCCCGGCC AGGAGTACCG GCGCAACCTG ATCATCGGGC TCGCCCTGGT CGTTCTCCTG
ATCGTCCTGG CGGCCGTCCC GGGCCGGCAC CGGCTCCGGC GCCGCTCCGA CCCCGACGGG
TACCCTCTGG GCCTCGAGCC GGGGCGCGTC CCGCTGATCG GGCTGCTCAC CCGCGTTCCG
GGCGCCTGGG CGGGGATGGC GCTGGCGACA GCCGCGGTGT TCCTGATAGC GGGCTGGCTG
GCGCTCGCGG TACCGGTCCT GGTCCTCGTC GGCCGGCGGT TCCCGGTGGT GCTCGGCGTG
CTGGCGGTGG CTGGCATGGT CGGCTCCGGC ATCGCCGTCG CGGTCAGCCC CGACAGCATT
CCGTTCTCGG GCGAGGGCGC GTTCGGCTGG CAGGCCCAGA CCCTCGGATC GCTGGCGTTC
GCCGCGACGG TCGCCGCACT CGCGCTGCGC CGTGCCGAGC CGGCCCGGCC CGCATCACCC
GGACCAGCCT CGGACGGCTC AGCCGGCCAT GTCCCGTAG
 
Protein sequence
MFAGFVLLFL LQAPGKLTAD TKLDVPLEPW RFMSAATHLW NSTSDFGFLP NQYAGYLFPM 
GPFFGLGNLL GVPPWITQRL WMAVLLTTAA WGTVRLAEAL GIGRPSARFL AGLSYALSPM
FLGKVGATSV AMVGAAMLPW ITLPLILALR PDGAGGADTG HRDDTGERAR ELAAARLSPR
RAAALSGLAV LCTGGINATV TLCVLLCPAV VLVFAGATRR AWALRAWWCV CVVLACAWWM
LALAVQGRYG LNFLPFTETA DTTTGTTSVG ETLRGAADWM AYLSLPTPWL PAAQEYVSTP
LAVVASAVVS AFGLAGLVRR DLPARRFLLV TLAVGVVSVA AAYPGQPGSP LADGVRSLLT
EPFGFLRNVY KFQPVVRLPL TLGLAHLLGV ALSWRPGAVR AGAAGPPEDA GPAGRARRGA
EGAGDDRRRL VPVLVILVTV GTLVAGMAPM LRGQGLQPRP FTKVPDYWAQ AADWLADHPE
GGRALVLPGA PFGEYQWGRP LDEPLQWLAR TPWGVRNIIP LGGVGTTRLM DGIEHMMATG
STPGLGVTLA RAGVGQVLVR NDIEQKDWDI PPSTDQLHRS LESSGLVRAA SFGPEVQART
AAKARLVESL SESAEKVPAI EIWTVPGGAS MVQAYPVDTG VVVSGGPEAT VQLAGQGLLS
ADRAVVLAGD LAEPDRPAGG AAAGAADDTA ITRAPIADPP PASQVVTPTT AWAVTDTNTR
RGYTFGIVHD SASYLLGPDE TVAGRPGPPS QWVDRPVVGH QTVAGYADGM SVQASSYGYD
LLAAPDFAPS AAVDGQTSTS WTALRRQGAT SQGQWIQLDV GREMSVPYID IRLLQEGDWR
PEVEALRVTT ERGSAVTQVS PIEDIQRLAV PPGMSRWYRI TFDKVSRETD PVLGAGLREI
EIPGVRFQRY AQAPADMVDE FQAPDEGLVA YSFERTRVDP LQPFGGSEEI TLSRRFEVPR
RLTFTLTGTA SALPPPAGAE VDSSDDPLVI PCGQGPALTI DGVRHDIQVE GKYSDLATAR
PFRISLCSEG HQITLDPGQH LITVDLGQST MLVDSLSLVG TTAATSTEKP RTTRIGEWGA
ERRTIEIGAG ARSFVSVREN ANASWTATLD GKPLTAVRLD GWAQGWIVPA GAAGTIVIEN
LPGQEYRRNL IIGLALVVLL IVLAAVPGRH RLRRRSDPDG YPLGLEPGRV PLIGLLTRVP
GAWAGMALAT AAVFLIAGWL ALAVPVLVLV GRRFPVVLGV LAVAGMVGSG IAVAVSPDSI
PFSGEGAFGW QAQTLGSLAF AATVAALALR RAEPARPASP GPASDGSAGH VP