Gene Franean1_5836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5836 
Symbol 
ID5674159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7078903 
End bp7080138 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content78% 
IMG OID641244686 
Producthypothetical protein 
Protein accessionYP_001510088 
Protein GI158317580 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.425484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.644094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGCCC TCGGTGTCGC GGCGGTGCTG ATCCTGCTGG GCCCGCGGCT GGCGGACCTG 
CCGTCGAGCC CCGGCGACAT CCCCACGTGG TTCGCCGACG AGCCGGAGCT GGCGTTCGCC
AACTGCCTCG GCATCATCGC CTGGATCTGC CTGCTGTGGC TGTGCGCCGG AGTGGTGCTC
GGCGTCCTGG CCGCGCTGCC GGGCGCGGCC GGCCGGGTGT TCGCCGCGCT CGCCCGCCGG
GTCCTGCCCA GCGCGGTGCG CCGGATCGTC GAAATCGGCC TCGGGGTGAC CCTCGTGGCC
GCGAGCGTCA GCCCCGTTCT CGGCGCCAGC CCGGCTTCGG CCGCCACCGG CCTGCCCGCC
GCCACGGCGT CGGCGAGCGT GTCAGCGGAC GCTTCGGCGG GCGCCCCGGC TATCGGGCCG
ACGGGCGGGC TGGCCGCCTC CCAGCGTGGC TGGCCCTACC TGGGCCACCC CGACGCGGCC
GCCGGCGCCG ACGCGGCGAG GGCGTCCGGC TCCGGTGGTT CCGCCGCCGG TGCCACCCCG
GCGGAGGGCG CGACCGCGGT GCCGCCGGCC GGAAACGTGC CGTCGCCCGG TGCCGGGCGT
GGCCTGCCGG ACGACGTCCA GGTCGTGCCG ATGACCCCGG TCAGCGCGGC CGATCCACCC
ACCCGTGCGC AGCAGCCGCC GGATCCCGCC ACGGCCGCCC CTGACCCCGG CGCCGCGCCC
ACGGACCGAT CGGCCACGGC GACCGCGCGC GGATGGCCCT TCCTCGGCCA TCCCGACCGG
GCCGAGCACA CCGATCAGCA GGGCCAGGCC GGCCAGCCGT CCGGCACCAG CACGCCACGG
CCCACAAACC CGACCCCAGG TGCACCGGCC CCCAGTGCGC CGAGCACCAC CGCTCCGACG
CCCGCCGCCG CAGCGCCCAG TGCTTCCGCG CCTTCCGGCA ACGGCGGGGG TGATCCCCTC
GTGCCCTCCG CGCCGCTCGG CACGGGCACG CCGGGCACGG CGCCCGGCGC ATCCCCGGGC
ACTCCCCCGA CGAATCCGGG CGCGGCCGCG GAGGTCGTCG TGCTGCGCGG AGACAGCCTG
TGGACGATCG TCGCCCGTCA CCTCGGGCCT ACCGCCACGA CCGATCAGAT CGCGGCCGAG
TGGCCGCGCT GGTGGTCGGC GAACGCCGAC GTGATCGGGC CCGATCCGGA TCTTCTTCTT
CCCGGCCAGC GGCTCCTGCC GCCGCCCAGT CCCTGA
 
Protein sequence
MLALGVAAVL ILLGPRLADL PSSPGDIPTW FADEPELAFA NCLGIIAWIC LLWLCAGVVL 
GVLAALPGAA GRVFAALARR VLPSAVRRIV EIGLGVTLVA ASVSPVLGAS PASAATGLPA
ATASASVSAD ASAGAPAIGP TGGLAASQRG WPYLGHPDAA AGADAARASG SGGSAAGATP
AEGATAVPPA GNVPSPGAGR GLPDDVQVVP MTPVSAADPP TRAQQPPDPA TAAPDPGAAP
TDRSATATAR GWPFLGHPDR AEHTDQQGQA GQPSGTSTPR PTNPTPGAPA PSAPSTTAPT
PAAAAPSASA PSGNGGGDPL VPSAPLGTGT PGTAPGASPG TPPTNPGAAA EVVVLRGDSL
WTIVARHLGP TATTDQIAAE WPRWWSANAD VIGPDPDLLL PGQRLLPPPS P