Gene Franean1_0341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0341 
Symbol 
ID5668765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp409243 
End bp410526 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content70% 
IMG OID641239273 
Producthypothetical protein 
Protein accessionYP_001504713 
Protein GI158312205 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.156991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00506023 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACGTCTC AGGTGAGCAT CGATCTCAGG CCGGGCGTAC CGAAGGAAGG CGGGACCCGT 
GCGGTGGCGG CGAAGCCGAG GAGGCGGCGG CGGAAGGATC GTACCGGGTT CACGGCGGTG
GCCCGGGCCG GTCGGCGCCG CGCCGCGGGA TTTCTGCTGG CGGTCGTTGT TCTAGTCGGC
GCCAGTGCCT GCCAGGCCGA ATACAAGCCA CCGTTCGCAC CCATCGTGTT CGCCATCGAC
GAGAACGGCA ACATCGACGT CAGCGCGAGT CGTGACCTGG TGACCCCGAT CGGCACCTTC
ACGATCAGCG AATCGGTTGC ATTGCCGCGG GACGTCCCAT CGGACCGAAC CCTGATGATC
CTTCGTCACA AGGTCGCCGG TGAGCTGAAG GATGCTTGGT TCACGCTACT GGCCGCGGTG
AACCTGCACT TCTCCGTGGA CGGCGCGAAC CGACTCGTCC CGCAGGCCGA GGAGAACGTC
GCGCTGTTGG AGGTGACCGG CCCGGCGACC GGGGTGGTCG CCGAAGCGAC AGACGACGAC
TCCGGTGACC GTTTCGAGTC CGAGCAGGTC GAGGTACTGC CGGAGGTTCC CGAGGACACG
GACCCCTCGG CCACACCGGA CGACGGCGGG CCGAGCGGAG GGCCGAGCGG CGGGTCGGGT
GAGCCGGGCA TCGAGGTGAC ACCGGAGACG CTGAGTTGCG ATGACTCGGG GTGCGCCGGG
ACCGTGACCG TGGAGAGCAC CGGCACCGGC ACGCTGCGGG TCACCTCGAC CGAGATCATC
GGTCCTGACG CCGACGCGTT CTCCGTGGAC GCGGGCTGCG AGGCCGAGCT GCCTCCCGGC
GGGCAGTGCA CTCTCAGCGT CGGCTACGTG CCGCTGGACA GCGGCGAGGC GGCCGCCGCG
ACGCTGGTCA TCCACCAGAA TCTCAGCGGT CCCGCCAACG AGGTGAACCT GGAGGGGAGC
GCCGGGACGA CGCCGCCCGG CCCGGAGCCC GGCATCGCCG TGACGCCGGA GACGGTGTTC
TGCACCAGCT CCGCCTGCCA GCCGGTGACG GTCGAGAGCA CCGGCGACGA CCCGCTGGCC
GTCACGTCCG TCGAGATCGT GGGTTCAGGC GCCGCCGCGT TCAGCTACAC GAGCGACTGC
GAGGGCGCGT CCCTGCCGAC AGGCGCCCGG TGCGTCGTCA CCCCCGAGTA CACGCCGCAG
GGCGGCTCCG ATGCGACGGC CACCCTGGTC ATCCACCACA ATCTCGCCGG GCCCGCGACC
GAGGTCACCC TCTCTGCCTC CTGA
 
Protein sequence
MTSQVSIDLR PGVPKEGGTR AVAAKPRRRR RKDRTGFTAV ARAGRRRAAG FLLAVVVLVG 
ASACQAEYKP PFAPIVFAID ENGNIDVSAS RDLVTPIGTF TISESVALPR DVPSDRTLMI
LRHKVAGELK DAWFTLLAAV NLHFSVDGAN RLVPQAEENV ALLEVTGPAT GVVAEATDDD
SGDRFESEQV EVLPEVPEDT DPSATPDDGG PSGGPSGGSG EPGIEVTPET LSCDDSGCAG
TVTVESTGTG TLRVTSTEII GPDADAFSVD AGCEAELPPG GQCTLSVGYV PLDSGEAAAA
TLVIHQNLSG PANEVNLEGS AGTTPPGPEP GIAVTPETVF CTSSACQPVT VESTGDDPLA
VTSVEIVGSG AAAFSYTSDC EGASLPTGAR CVVTPEYTPQ GGSDATATLV IHHNLAGPAT
EVTLSAS