Gene Franean1_3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3520 
Symbol 
ID5671890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4181376 
End bp4182716 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content70% 
IMG OID641242407 
Producthypothetical protein 
Protein accessionYP_001507827 
Protein GI158315319 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.80961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACACTG GACCGGGCCC TGAACCGCGA CCCGCTCTTG ACCAGCTCAT CGCGCACATC 
GCCGCACTCA AACAGCGGCG AAACGCGAGC TGGTCGGCCT TGGAACAGCA GACCGGTATC
ACCAGCCAGG CCCTCTCCGC CGCAGCCCAG GGTCACCAGC CCGGCGGCCG GGAGTGGCGA
ATCCCCTCCG ACTCCGTGAT CATCGCCCTG GACCGCGGCC TTCGCGCCGA TGGGAGCCTG
TTCGAGCGCT GGGTACAGGT CAAACGCGAA GACGAGGAAA TCCGCCTGGG ACGCATCGCG
GCGAAAGCGT TTCCCGCAAT GGATGGACTA CCCCCACCGC CAGGGCAGAC AGTGGACAGG
GCAGGGGAGG CGAGAACGAC GGACAGAAGA CTTTTCGGTG TCGGAGGCCT CAGCGTGGCG
GCGGTGCTGG CATTCGCTGG CGACGTGGAC GACCAGCTCC AGTCACACCG ACCCAAGATC
ACGGACGTGG CGGACGCGGA GACCGCGCTC GCCAATCTCG AACGAGACCG CGACGCCGCG
GACCCGGCGG ATCTGTTCCC GCCCGCCTAC GAAGCCTGGA CGGCGGTTGA GGGGATCCTG
CCGAGGCGAG TCCATCCCGC GTACGTCCCG AAATTGACCC TGCTGGCAGG GACTCTCGCG
GCCGGCCTGT CGACGGTCGC CTCGTTCGCC GGGCACGAGC GGTTCGGCCG GGTCTTCGCC
GGCATCGCCG AAGTACACGC CAACGCCGCC GGAGAACCAG CACTTCGGGC CCGCGTCGCC
GGAATCCAGT CATGGCAGGC GCTCGACGCG GGTCTCGCGC TCGACGCCGC CGACATCGCC
GCGCGGGGAC GCCAGCACGC GGATCCGGCA GACCGGGCTC GCCTCGCCGC CTACGAGGCG
GAGGCCGCCG CCGCAGGCCT GTACAGCCGC GCCGACGAGG CAGTCGCGGC GATGCGAACC
AGCATGCGCG CCGCCGCTAC CGGCCGGCCC GCGATCGCCT GGGGCGACGC CAACGAACAG
CTCTTCACCG CGCTCACCGC GGCACAGACA CCCGGCCGCG CCACAGTGGC GATCACCCTC
GGCACCAGAG CCGCCGAATC GTTCGACCGT CCGTGCCAGG GCATGGCTCT CGCTCATCTG
GCTGTCGCCA CTGGGTACGT CCGTAAGGAT CGTCCGGCCC CGGACGTGGC CGCCGCCTCA
GCCGTCGCCG CACTCGACAT CGTCGAACAC GCCCCGAACG CAGAGGTCCA TGACCGTGCT
CGCCGGCTCG CCACCGAGCT GTCCGCGTGG AGCTCCGAGT CACTGGTGCA GGAACTCGAC
CAGCGGACCG CCGCCCTCTG A
 
Protein sequence
MHTGPGPEPR PALDQLIAHI AALKQRRNAS WSALEQQTGI TSQALSAAAQ GHQPGGREWR 
IPSDSVIIAL DRGLRADGSL FERWVQVKRE DEEIRLGRIA AKAFPAMDGL PPPPGQTVDR
AGEARTTDRR LFGVGGLSVA AVLAFAGDVD DQLQSHRPKI TDVADAETAL ANLERDRDAA
DPADLFPPAY EAWTAVEGIL PRRVHPAYVP KLTLLAGTLA AGLSTVASFA GHERFGRVFA
GIAEVHANAA GEPALRARVA GIQSWQALDA GLALDAADIA ARGRQHADPA DRARLAAYEA
EAAAAGLYSR ADEAVAAMRT SMRAAATGRP AIAWGDANEQ LFTALTAAQT PGRATVAITL
GTRAAESFDR PCQGMALAHL AVATGYVRKD RPAPDVAAAS AVAALDIVEH APNAEVHDRA
RRLATELSAW SSESLVQELD QRTAAL