Gene Franean1_7000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7000 
Symbol 
ID5675311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8530266 
End bp8531444 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content69% 
IMG OID641245846 
Producthypothetical protein 
Protein accessionYP_001511237 
Protein GI158318729 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.550341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGAA AGCTGGTCTG TCCGTACTGC TACCAGCAGT TCGGGGAACG GGAGATCTGG 
TTCCGGTGCT CCGGGCGGCC AGGCCCCACC GGAAAGTCCT GCTCCAGTCA GCGGGATGAA
CGCCTGGCGA AACGGATGGG ATTCACCGGC CAGCTTCCGC CGGCGTTCTC GGCGGACGGC
CGGAAGCTGT CCGCCGTCCA TTCGGACTGT CGAGCCGAGA CGAACTACCG GTTGTGCCCG
GAGTGCCACA GTCAACTTCC GGTGCACTTC GGAAAGATCG AGAATCGGCT CATCGCGCTG
GTCGGTGCGA AGGAGAGCGG CAAGACCGTT TTCATGACCG TCCTGCTGCA CGAGCTGATG
CACTCGGTCG GAGTCCGGTT CGACGCGTCG GTGCTGGGTG CGGATGACGA GACCCGGGAC
AGCTTCCGCA AGCGGTACGA GGCCCCGCTC TACGACAACC ACCAGCTGGC CGCCCCGACA
CAGCGCTCAA CGACACCGAT GTCACGCCGG CCGCTGGTGT TCACCTTCAC AGCGCGCGGG
CGCGGCCTGG GCCGCCCGCG TCAGGAGCGG ACCGTGCTGT CCTTCTTCGA CACCGCCGGC
GAGGATCTGA ACTCGGCGGA CAGCGTCGAA CAGAACGTGC GCTACCTCGC CAGCGCCGCC
GGCATCATCC TGCTGCTCGA CCCACTGACG ATGCGCGGTG CCCGGGGCCA GGCGGACCCG
GACGCCCCAC GCCCGCACGA ACAGGGCCTG GACAGCCCGG TGAGCGTCCT GGGCCGCATC
ACCGAGCTGC TGCAGCGGGC GTTGGGGACG AAGCCCTCCC AGCTGATCGG CACCCCGATC
GCCGTGGCGT TCTCGAAGAT GGACGCGCTC ACGCGCGGTC TGCCCGAGGA GAGCCCGCTG
CGGCGGTCGC AGCCGGTCGG CTCGCGCTTC GACGCGGCGG ACAGCAGGGA CGTCCACGAC
CATGTGCGCG CGCTGCTCGA CGAGTGGGAG GGGTCGTCCA TCGACCAGAC CCTGCGCCAC
AACTACTCGC GCTACCGGTA TTTCGGGCTG TCGGCACTGG GGGCCGCCCC CACCGCCGAC
CGGCGGGTGG CGACCGGGGT GGTCCAGCCC TACCGGGTGG CCGACCCGTT CCTCTGGCTG
CTGAGCGAAT TCGGCGCCAT TCCCAGAACA AAAGGCTGA
 
Protein sequence
MSRKLVCPYC YQQFGEREIW FRCSGRPGPT GKSCSSQRDE RLAKRMGFTG QLPPAFSADG 
RKLSAVHSDC RAETNYRLCP ECHSQLPVHF GKIENRLIAL VGAKESGKTV FMTVLLHELM
HSVGVRFDAS VLGADDETRD SFRKRYEAPL YDNHQLAAPT QRSTTPMSRR PLVFTFTARG
RGLGRPRQER TVLSFFDTAG EDLNSADSVE QNVRYLASAA GIILLLDPLT MRGARGQADP
DAPRPHEQGL DSPVSVLGRI TELLQRALGT KPSQLIGTPI AVAFSKMDAL TRGLPEESPL
RRSQPVGSRF DAADSRDVHD HVRALLDEWE GSSIDQTLRH NYSRYRYFGL SALGAAPTAD
RRVATGVVQP YRVADPFLWL LSEFGAIPRT KG