Gene Franean1_2818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2818 
Symbol 
ID5671207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3334629 
End bp3336782 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content61% 
IMG OID641241727 
Producthypothetical protein 
Protein accessionYP_001507147 
Protein GI158314639 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.245022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGAA GCTCACCCGA TGTCGAGCCA GACAGGCGGA TCTGCCTCCC GGACTCGACC 
TTCGAGTTCA GCTCCGCTGC TGCGGCCGGT GCCGAGACCC TTCTCACCGG ACCTCCGCAG
GCGCTCGTAC ACGCGCTGTT GGATGACCAT GAGTCGGAAT TTGTTTTGCT GGCCGCGCGT
CGCATACTGT CCGCTCTTCT TGATGATTCA TTGGGCGAGC ACAGCGAACT CGGCTTGGAC
GCAGTCGTCG AGATGGTGGC TACATTACGT CGTGAACTCG ATGCGAAGAT TGTCGGTATT
TCCGGCGGTA ATTCGTCCGC ACGGGTCGTA ACGCTACTCG AGCGTGCGCC ATTAGCGCTG
CTAGCCGGCT GCTGGCTGGA CACGGTCTCT CAGCCCGCGA CGCAGCCGGC GCTGATTGTC
AATCGCCTGT TAGAGGACTG CGTAAGGCTT CGAGGTGGCG GTCATCCGAG GAAGGCCCTA
ACGCATCTGC GACGCACCGC TCTAGAGTCC CAGGGCGTTC ACCTGCCTGT GCTTGGCGCC
GAGGATTTTA TCTCCGCATC CCAGGCGCAC CCACTGACGG TGCGGCAGGC CATTTTCTAC
CTTGCGTTGT CTCGCTTTCC CGCGACTTTT CTGCCCGAAG TCGTCGGCGT GCACTACGCG
GTGTTTTCTC TAGGGGTGGA CGATCGCCTC TTGCGGATGC CACCAGCGTT GTCGGAAGCA
TCGTTGCGTG CCGTTCTGGC CGAATATTTC ACTCTGGCGG ACGCGTCCGT CGACGGAGCC
CGGGTTCGCC AACGACTGGC CGCAGCGGTC GGACTCGTGC TTCGTCTGGA GCGTGAGCAA
GTTGATCTGC TATCGGAACT CGCCACGCGG AACGTCGATC AGTCACCGCA CAGCGAAGTC
ACCGAGATCG TGCGTCGACA CCTGCCGTTC GCTGGTGAGC ATCATCGCGA TGTCGTCGTC
GGTGGTCGGT CGTTGGCGGA CGTGGCTGCT TCCGATGACG CCGAGGTCGC CGAACTCATA
CGCGAACTGC ATGCGTCCCC GTACGTCCGA TCGCCAACTC ACAGTGGGCC TCGGATCGTC
AGGGCCATCA AGTTCGGTGG TCCAATGTTC GGCATCTTCG ATGAGGCCGA GGCAGCCGCC
TTGACAGCGT GGGCGAGGGA ACCCGATGTC GCCGCTGCGG ATTCGGCGGC TCAGGTGCTC
TACGGGTATG AGGTCTCCCC AAACCACGCG ATTTCCATTG GTGACGCCAT GCCGGCTGAC
GTGGTCTGGG CTGACCGGGC ACCCGACCAC GATCGCCAGC TATTCCACCG CCTAGTGAAC
GTTGAGAACT TTCCAAACAT CCGCCCGGTG GCGAGAGAGC GGGCTGCCCA GGTTCTCCAG
GCGGCGGAGG TGTTATTCGA GCACGGCTCG TCTGGCAGGT ACACGGACGC CAGCTTCTTT
CCCTACGAGT CCGGGGCTCT GCGCGAGCGC ATCGAGAGAA TTTACTTCGA TAAACGCCTT
AGACGGTCTG AGGCGACGAC GGAGTTACCA TCCCGGGGCG CCGTCGTATC GAGCCGCAAG
GAGCGCCTGA TCACCAATAT GGTGGACGGC TGCTGGCTTT ACAGAATTGG CGCAACAGGC
CGGTACGGCA GGGACAGCGA CGGACAGCTG TTCGCCATCT ACGCCGATGA GATGGGCGGC
GGGGACATCC GCAAGAATCA CATCATGCTC ATCCACGGGG CGCTCGCAGA CATGCGCATC
TCGGTGCCAC ACATTAGTAA TGTCGACTTT CTTTCCCAGT GCGAGCTTCC GGACAGTTCC
TACGCTCCTG CGATCTACCA GATCTGCCTG GCCTTGTTTC CTGACAGCTA TTACCCTGAG
ATTCTCGGCT ACAACCTTGG CATGGAGATG GGAGGGATCG GCGAGCTTGG TATAAGCGAG
ATCCGGCGAT TGCGCCATTA CGGATTCGAC GCCACGTATG AGGCGACCCA TCTGTCCATC
GACAATATTT CCAGTGGTCA CTCCCGGCAG GCTGCGGACA TCATCGTCCG CTACCTTGAC
GACGTACGCC GGGAATCGGG CGATGCCGCC GTTGCCGCAC GGTGGCGCCG TGTCTGGCGC
GGCTATGCTT CGTTCGCCTA TTTCGCCGAG CGAGATTTGG TCCAGGACCT ATGA
 
Protein sequence
MKRSSPDVEP DRRICLPDST FEFSSAAAAG AETLLTGPPQ ALVHALLDDH ESEFVLLAAR 
RILSALLDDS LGEHSELGLD AVVEMVATLR RELDAKIVGI SGGNSSARVV TLLERAPLAL
LAGCWLDTVS QPATQPALIV NRLLEDCVRL RGGGHPRKAL THLRRTALES QGVHLPVLGA
EDFISASQAH PLTVRQAIFY LALSRFPATF LPEVVGVHYA VFSLGVDDRL LRMPPALSEA
SLRAVLAEYF TLADASVDGA RVRQRLAAAV GLVLRLEREQ VDLLSELATR NVDQSPHSEV
TEIVRRHLPF AGEHHRDVVV GGRSLADVAA SDDAEVAELI RELHASPYVR SPTHSGPRIV
RAIKFGGPMF GIFDEAEAAA LTAWAREPDV AAADSAAQVL YGYEVSPNHA ISIGDAMPAD
VVWADRAPDH DRQLFHRLVN VENFPNIRPV ARERAAQVLQ AAEVLFEHGS SGRYTDASFF
PYESGALRER IERIYFDKRL RRSEATTELP SRGAVVSSRK ERLITNMVDG CWLYRIGATG
RYGRDSDGQL FAIYADEMGG GDIRKNHIML IHGALADMRI SVPHISNVDF LSQCELPDSS
YAPAIYQICL ALFPDSYYPE ILGYNLGMEM GGIGELGISE IRRLRHYGFD ATYEATHLSI
DNISSGHSRQ AADIIVRYLD DVRRESGDAA VAARWRRVWR GYASFAYFAE RDLVQDL