Gene Franean1_6380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6380 
Symbol 
ID5674696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7743024 
End bp7746077 
Gene Length3054 bp 
Protein Length1017 aa 
Translation table11 
GC content67% 
IMG OID641245229 
Producthypothetical protein 
Protein accessionYP_001510624 
Protein GI158318116 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.726246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGGCA CGGTGGTCAC CTTCTACTCG TACAAGGGTG GCGTCGGGCG TTCTCTCGCA 
CTCGCCTGGG TAGCCTGGAT CCTCGCCTCG GCCGGGAAAA AGGTCCTCGT CGTCGACTGG
GATCTGGAGG CCCCCGGCCT GGAGAACTAC TTCTGGCCGG CCGCGGCCGT CAAGGGGATG
CGGACCAGGC CCGGTCTCGT GGATCTCGTC ATCGACTACC GCGACCGCCT CGACGCGGAC
GCACTGGCAC GGTACCTGGA ACGTCATTCA TGGCAGCCGC TCGCCGCCCA CCCGGACTGG
TCCGGGGAGG ACTGGGCGGA CCCGTCGACC GCCGAGCAGG CACGCGCGGA GCTGGCCGAT
CCCGACCTGC GGGAGTGGGT CCGGGAGCGA ATCACCAGGG TGGATCTGGA TGCGGTCCGC
GCCGAGCTCA CCAGGGATTC GCCCGGCGGC GAGGCTGATT CTGACCCTAT GAAGCAGCGG
TTGAGGGACT GTACCAGCTG GGCCTCGTTC GTCGAGGAGC TGGACATCGA CGACGTTTTC
CAGCCCGGTG GCTCGGTCGA CCTCATGCGA TCCGGCCGCC TCGACGCTGA ATACGCCGCA
AAGCTGGCGC TCCTGGACTG GAACGAGCTG TACACGAAGT TCGCCGGGCA CCGCTTCTTC
CGGACCCTCA AGTCCAGATG GACGGACTAC TACGATTTCG TCCTGGTGGA TTCGCGGACG
GGCATGGGGG ATGTGTCGGC CACCTGCACG CAGGTTCTCC CCGATGTCAT GGTCGCCTGC
TTCGGTATGA ACGAGCAGGG CATCCTCAAC ACCGCACGGG TCGCCGCCCA GGCCCGTGCG
GCGGGCGCCC GTGAGCGGCG TGAACTGAGA ATCCTGCCGT TGCCGTCACG GATCGAGACC
TCCGCGCGGG CGGCGGCGGG AAAGGCAAAG GACCGCTACC AGCGGGTGTT CGAACTGCCG
TGGGAGGAGT TCGAGGAGGT CTCCACGACG CTGCCGCGTC AGCCGTTGAT AGGGCCCAGA
ATCCTGGGTG GCACGCAGAA GTACTGGGAG CAGGTGGCGT TGCCGCACAC CCCGCGGTAC
GCGTTCCAGG AGGGTCCGCC GCACGGAATC GACCGGGATC GTCTGCAGCG GGCCTGCGAA
TTTCTCGCCC AGAACATCGG GCGGGATGAC ACAATCCGAC TGAACCCGCC GTCGCGGACG
GCATGGGCGA AGATGGTGGC GGGCTTCGAC CGTCCCGCGC GCAAGCTACA GTACGACGAG
TTCGTGATCA GCTACGCCGA AGGCGGTCGG GATCCGCTCT GGGCGGCCTG GGTGTACGAA
CAGCTGTGCC GGATCTCCGC GCGGGTACGG CAGCACAACG CGAGCCGGGA CGGGACGGGC
TTCCTCGACG ACCGGCTCCG GGACGCCGGC GGCGATCCGG GTACGGACAT CGACGGCGGG
AAGCTCTGTG TGCTGATCCT GCTCAGCCCC AGATATGTCG AATCCGACTT CGGCCGGGCC
ACCTGGCGGT GGGCCGCCAG CCGCTACACC AGGGCGGACC CGGCCGACCG TGGCTCCAGC
CGATGGAGCC TCCTGCCCGT TCTCATCTCC TCCCACGCCC CGGAGACGGC ACCGTTCGCC
GAGATCCAAC CGGTGAGCCT GGTCAAACTC GACGAGGAGA CAGCCAGAAC GACGCTGCTG
GCGGAGCTGG CCGACACGTG GGACGAGCGC GAGTCCGCGG TCCTGACGCC CGTGGAGTTT
CCCGGGAGCC CGGAGGATCT CGTCCGCCGT TACCAGGAGC GGGCTGACGA GCTCGAAAGG
GCGAACGCTG TGACGGCGGA ACGGGTGGCG ACCCTGCGCC AGCTCGCCCA GGAGAAGCTG
GAGCATCACG GTACGACCTC GACCTCCCAG GCCGTCCAGC ACCTGAGGGA GGCGAACCGG
CTCGCCCGTG ACAGGCGGGA CAGCCTCGTG GCCGCTCTCA CCGGGTTCGA GCTCGCCTAC
GCCGAGTTTT CGGAAGAGCA ACTACCACAG CGGCGGCGTA CCGGCCGGCC CCTACAGACC
GCCGTGCGCG CCGTCGAGGA GACCATGCGG CTGGTGGACT CCGACACCAT CGCCGATCTC
TCCGACGCCA CCGACATCCG TGGCGCCATT GACCCCAACA ATTCCCGCGG CGGAATCCAC
GCCAGGAGCA CCCTGACGTC TTCCAGGCTC TACCGCAATG ATCTGCGCCC CCTTCGCCGG
GCCATAGACG AGGTGGACCG GCTTGGGCTG ATCGAACCCA TGTCCCCGTA CGGGCAGCGG
CTGCGGCTGG CGAAGGCGGT TGTGCTCCGT CTCCTCGACG ACGTCCAGGA GGCACGGCCA
CTCCTGGTCG AGGTCCACCA GAACGCCGGC CGGGACCGCG CTCTCCGCGC CCGGTGCGAC
CTGGAGATCG GGATGGGGTC GTTCGCACGC GGCAGAACGT TCTGGGACGA CGCGTACCGC
AAACTGCTAC GGCTGGTCCA GCAGCCAGAT GCCGAAGCCA GGATCAGGTT CCTGGCGCTT
TCCACCCTAG GCGCGATGTC GAAGCCCGAA GAGGCCAAGG GGCATTTCGA GCGGGCGCTC
GAGGCGGTAG CTGGGGAACG CCACCTCGCC GGACTCCAGA TCGCCGCCCT CGTGAATCTC
GGCCGGAACG CGGCTGCCCG CGGCCTGCAT CCCACGGAGG AGGGCGGACC GCTGAAGCAC
TTCGAGGAGG CGGTCGACGC GGCCAGGGGC GAGTGGCCAC TCGTCTCGCT TCCGCTCGCC
GAGGCATACT GGCAGACCAG CCGTCCGGTG AAATCACCTC ATACATCGAA GGAAATTCTG
GCCCGTCTGC TGCGAGCGAT CTGGATCTAC ACTGTTCTCG GCCTGCAGGA CGAGGTCCAT
GAATGCCGAA AGATCATCAG GAGTCGCGCG GCGAAGAACA TGGGCTGGGA TTCCTTCAAG
GAAATAAACG ACGGCGAACC GACCTCCACG TTCACCCCCG CGAAGATCAG CGACTTCGAC
GTGATCATGG CGATCCTCGA CGTCCGAGTA CCCGACACCA CCGCGCTCCG CTGA
 
Protein sequence
MPGTVVTFYS YKGGVGRSLA LAWVAWILAS AGKKVLVVDW DLEAPGLENY FWPAAAVKGM 
RTRPGLVDLV IDYRDRLDAD ALARYLERHS WQPLAAHPDW SGEDWADPST AEQARAELAD
PDLREWVRER ITRVDLDAVR AELTRDSPGG EADSDPMKQR LRDCTSWASF VEELDIDDVF
QPGGSVDLMR SGRLDAEYAA KLALLDWNEL YTKFAGHRFF RTLKSRWTDY YDFVLVDSRT
GMGDVSATCT QVLPDVMVAC FGMNEQGILN TARVAAQARA AGARERRELR ILPLPSRIET
SARAAAGKAK DRYQRVFELP WEEFEEVSTT LPRQPLIGPR ILGGTQKYWE QVALPHTPRY
AFQEGPPHGI DRDRLQRACE FLAQNIGRDD TIRLNPPSRT AWAKMVAGFD RPARKLQYDE
FVISYAEGGR DPLWAAWVYE QLCRISARVR QHNASRDGTG FLDDRLRDAG GDPGTDIDGG
KLCVLILLSP RYVESDFGRA TWRWAASRYT RADPADRGSS RWSLLPVLIS SHAPETAPFA
EIQPVSLVKL DEETARTTLL AELADTWDER ESAVLTPVEF PGSPEDLVRR YQERADELER
ANAVTAERVA TLRQLAQEKL EHHGTTSTSQ AVQHLREANR LARDRRDSLV AALTGFELAY
AEFSEEQLPQ RRRTGRPLQT AVRAVEETMR LVDSDTIADL SDATDIRGAI DPNNSRGGIH
ARSTLTSSRL YRNDLRPLRR AIDEVDRLGL IEPMSPYGQR LRLAKAVVLR LLDDVQEARP
LLVEVHQNAG RDRALRARCD LEIGMGSFAR GRTFWDDAYR KLLRLVQQPD AEARIRFLAL
STLGAMSKPE EAKGHFERAL EAVAGERHLA GLQIAALVNL GRNAAARGLH PTEEGGPLKH
FEEAVDAARG EWPLVSLPLA EAYWQTSRPV KSPHTSKEIL ARLLRAIWIY TVLGLQDEVH
ECRKIIRSRA AKNMGWDSFK EINDGEPTST FTPAKISDFD VIMAILDVRV PDTTALR