Gene Franean1_3815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3815 
Symbol 
ID5672179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4531225 
End bp4533678 
Gene Length2454 bp 
Protein Length817 aa 
Translation table11 
GC content72% 
IMG OID641242694 
Producthypothetical protein 
Protein accessionYP_001508114 
Protein GI158315606 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.888105 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.726246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGTCGG AGCCATCGGG TGCGAGACAA TCGCGATCCG GGGAACAACG CACCCGGGCG 
GCGGACCCGC TGCGCACCAA GCCGGCGCGC TCGCCTCACC CCAGCGCGGA CCGGCCCGGC
AACCTGCTGG CCCTGCAGAC GCTCGCCGGC AACGCGGCGG TCACCGACCT GATCGAAGCG
GCCGGACCGC CCGGTGTGGC GCGCTGGACC GGACCGATCA GCTTTCAGTC CCAGGCCTCG
TTACTGGACG AGGCGCGCAA GGGCAGCTAC AACGCGGTCA TTCAGCTGGA CGAGGCGACC
TTGGCCGGCG CGGACGACAA CGATCGCCTC AAGTGGATTG ATCAGGTCAA CGACAGCACT
CTGGTCGTCC TGCGCGCCTC CCGGGCGCTG GAACGGATCT GGCGAAGCTT CGGCAGCCGT
TTTCTCGAGG TCGCCGGGGC GAACCCGGAC AACCTGGCCC GCTGGCGGCG CAGCTGCGCG
CGACACACCG GGCTGCCCGA GCAGGTGCCG CAGGCCGCGG ACCTGCAGGG CGCCTGGTTG
CGGGACATTC GCACGATCGG CGGCGGCTGT CTGGACACCA ATGAGGAGTT CGCCAGGAAC
AAGCTCCAGC AGTTCGGCGC GTCCGAGTCG GGCGACACCA TGGCGGCGCC AACCGACGAG
CAGGCAAACG CGCTGAGCCA ACTGCAGACG GCCGCCGAGG GGCTGGCCGC GCTGCGTTGG
GGACAGGAGA CCGCGCGGCA GATGTACGTC GGGTATGTCG ACTACCTGCC GCCGGCCGGT
TCGATGCAGG ACGCCCACCA CTACCGCCGG GTCCGGTTCG ACCCGACCGC GGCGCCGCCG
CTGACGAGGA TCGAACGTGA CCGGGAGCGG CCGGCCTACC TCTATCGCGA GCTCACCGAG
ACCGAGGAGC TGGTCTCGGA CGATGCCGAC CCGAATGCCC TCGCCCCCGT CCAGTCCTAC
GAGGATGCGC GCCGGAAATA CGACGAGGCG GAAGCCGCCG CGAGCGTGAC CCTGTCGATC
TATCCGGAGC TGTTCGCGTT CTCCGGCAGC CAGTCCGATG CCGGGCTGGG CCAGTTCGCG
GTCGCCCAAA GCAGCTCCGC GGCCCGGCAA CAGTTGGTGA CCGGGCTACG GACCATGCTG
AGTCACATCC GCGCCACCAG ACAGCAGCTG GGGCCCGGCG GTGGCCTCGA CCCGCTGGAC
CTCACCCCCA TCCACCGGCG ACTGCTCCGC GGCGAGATCA CGGCGTCTTC GGGCACCGAC
TGGACCCGGC CCTTCGCGCG CGAGGTGGCG GGAAACCTCG TCCAGGGCCA CAACGTCGAC
ATCGCCCTGC ACCGGCTGGG GCTCCAGCTG ATTGCCGAGG CCGCGTTCCT CTTCGCCCCG
GCGACGGGTG GGCTGACCGC CGTCGCCGCG CTGACCCTCG CCACCGGTGC CTCCGCCGGG
AACGTCGCTC TGGACGCCAG CCGGTACGCC GCCCTCGCCG ACGCGGCGGC ATCGGCGGCC
CGCCCGGGTA CAGCACTGGT CGACCGCCGG ACCGTCGACG ACGCCCGGAT GGCCACCGAG
TCGGAGGCGA TCGCGCTCGC CCTCGCAGCC CTCGCCCTGG GCGCCGCCGC GGCCGCCGGT
GCCCTGCGCG CCTGGCGTGC CCGGCAGACG CCGCCACCAG AACAGCCGCC CCCCGCCCAG
CCGCCGAGCG GGGGGCGGCC GGCCCAACAG GGCAACGCAC CCCAGCAGGG CGGACAACAA
GGCGCGCCCC AGCCGGGGAC GCCACAAGCG GGCGCGACTG AGCAGGGCGC GCCGCAGCAG
GGGAACGCGC CCCAGCAGCC GCCGGACCCC GCTGCGGCCG TGGTGGCCCA GGCACAGGCG
CAGACACAGG CCGCCCGCGC CGCCTTCGCA GCCGAGATCG GAATCGACGC GGGGACGCTG
GCCGGCTTCA CCGAGGAGGA GATCAACCGG CTGCGTCAGC TGCTTCCCAA CAGGCACCCC
AGCCGAATAG CGGGGCTGCG CAACTACCTC AGCGAGCAGG TGAGCCGGGG CCGACACACC
AGGAACATCC TGCGCACCCT GGAGGAGATG GAGCCACGGG AGCGGGCGCG CTACCTCGAC
CGGCGAGCCG CCATCCGATG GAATCCCGAC TGGCGTGGCC GTGACCCGGC GCCCCGGCTG
GAAGTCGGCA ACGCGGACGA GGGGTGGACG CACATCGATG CGCGGCATGT CACCGGCAAC
GCTGCCGGCG GAGCCGGTGA CCTGTTCGCG CCGGGGACGA CACGGCAACA GATCTTCGAG
GCGGCGGTCG AGGTCATCGA GCGCGGAAAC CGCGTCTCGG CCCGCGGCCA GCGGATCACG
ACGTTCGAAC GGTCGTTGTA CGTCAACGGG CGGCGGGACG CGATCCGGGT GACGGTGGAC
ACCTCGGACG GTCGTATTAT CACCGTCTTT CCCGTCCGCG GAGGTGGGCC GTGA
 
Protein sequence
MLSEPSGARQ SRSGEQRTRA ADPLRTKPAR SPHPSADRPG NLLALQTLAG NAAVTDLIEA 
AGPPGVARWT GPISFQSQAS LLDEARKGSY NAVIQLDEAT LAGADDNDRL KWIDQVNDST
LVVLRASRAL ERIWRSFGSR FLEVAGANPD NLARWRRSCA RHTGLPEQVP QAADLQGAWL
RDIRTIGGGC LDTNEEFARN KLQQFGASES GDTMAAPTDE QANALSQLQT AAEGLAALRW
GQETARQMYV GYVDYLPPAG SMQDAHHYRR VRFDPTAAPP LTRIERDRER PAYLYRELTE
TEELVSDDAD PNALAPVQSY EDARRKYDEA EAAASVTLSI YPELFAFSGS QSDAGLGQFA
VAQSSSAARQ QLVTGLRTML SHIRATRQQL GPGGGLDPLD LTPIHRRLLR GEITASSGTD
WTRPFAREVA GNLVQGHNVD IALHRLGLQL IAEAAFLFAP ATGGLTAVAA LTLATGASAG
NVALDASRYA ALADAAASAA RPGTALVDRR TVDDARMATE SEAIALALAA LALGAAAAAG
ALRAWRARQT PPPEQPPPAQ PPSGGRPAQQ GNAPQQGGQQ GAPQPGTPQA GATEQGAPQQ
GNAPQQPPDP AAAVVAQAQA QTQAARAAFA AEIGIDAGTL AGFTEEEINR LRQLLPNRHP
SRIAGLRNYL SEQVSRGRHT RNILRTLEEM EPRERARYLD RRAAIRWNPD WRGRDPAPRL
EVGNADEGWT HIDARHVTGN AAGGAGDLFA PGTTRQQIFE AAVEVIERGN RVSARGQRIT
TFERSLYVNG RRDAIRVTVD TSDGRIITVF PVRGGGP