Gene Franean1_6156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6156 
Symbol 
ID5674477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7489894 
End bp7491201 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content75% 
IMG OID641245008 
Producthypothetical protein 
Protein accessionYP_001510406 
Protein GI158317898 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.665559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACC GCAGCGACGG CGACCGCGAT GACAGCGGAC GCCACGGCAC CGGACCTGGT 
GGGCCCGCCC ACCACCATGA CGGGGACGGA CCTTCCGGCG GGCGGGACGA ACTCGGCGAC
GACTGGGGCC GGGTGGATGC CGACGGCACC GTGTACCTGC GCACCGCCGA CGGTGAGCGC
GCGGTCGGCT CGTGGCGCGC GGGCAGCCCT GAGGAGGGCC TGGCCCACTT CCGCCGTCGC
TACGACGACC TCCTCGCCGA GGTGGTGTTG TTGGAGCGGC GGCTGACGGT GAGCGGCGTC
GACCCCGGCG GCATCGCCGG CAGCGCCCGC CGGCTGCGGG AGGGGCTGGC CCAGGCCTCG
GTCGTCGGCG ACGTCGACGC CCTGGCCGCG CGGCTCGACG CCGTCCTGGC GGCCACGGAC
ACCCGCCGCA CGGAGCTCGC CGCCGAGCGG GCCCGCCGGG TCGCCGCCGC GGTCACCGCC
AAGGAGGAGC TGGTCACCGA GGCCGAGCAG CTCGCCCGCA GCTCCGAGTG GAAGGTGACG
AGCGAGCGTT TCCGGACCAT CGGCGACGAT TTCCGCGCGA TCACCGGTGT CGACAAGCGG
ACCGACTCGG CGCTGTGGCG GCGGATCGCC GCGGCCCGCG ACGAGTTCAC CCGCCGCCGC
ACCTCGCACT TCGCCGCGCT CGACACCCAG CGCACGCGCT CACGCGAGCG CAAGGAGGCG
ATCATCGCCG AGGCCGTCGC GCTGGCGGAC TCGACGGACT GGGGCCCGAC GACCGCGCGG
TACCGCGCGC TGATGGTCGA GTGGAAGGCG GCCGGCCGGG CCGCCAAGGA CGTCGACGAC
GAGCTGTGGG CCCGGTTCCG GGCCGCGCAG GACGGCTTCT TCAGCCGGCG CAACGCCGTG
AACGCCGAGC GCGACGCGGA GCAGATCGCC AACCAGGCCC GCAAGGAAGA GCTGCTCGTC
GAGGCCGCCG CGCTCGATCC CGTCGACGTC GAGCGGTCGC TGCGCCGGTA CCGCGAGATC
CAGGAGCGCT GGGACGCGAT CGGCCGGGTG CCCCGCGAGG CGGTCGGCAG CCTGGAACGC
CAGCTCAACG CCATCGGGGA CAAGCTGCGC GATGCCTCCG ACGCCCGTTG GGACCGTCGT
GACATCGCCG AGTCCCCGTT CCTGACGAAG CTGCGCGAGT CGGTGGCGAA GCTCGAGGCG
AAGCTGGAGC GCGCCCGCGC CGCCGGCCGG GCCCGCGAGA TCACCGAGAC CGAGAACGCC
CTCACGACCC AGCGCGCCTG GCTGGCGCAG GCCGAGAAGG GAAGCTGA
 
Protein sequence
MSDRSDGDRD DSGRHGTGPG GPAHHHDGDG PSGGRDELGD DWGRVDADGT VYLRTADGER 
AVGSWRAGSP EEGLAHFRRR YDDLLAEVVL LERRLTVSGV DPGGIAGSAR RLREGLAQAS
VVGDVDALAA RLDAVLAATD TRRTELAAER ARRVAAAVTA KEELVTEAEQ LARSSEWKVT
SERFRTIGDD FRAITGVDKR TDSALWRRIA AARDEFTRRR TSHFAALDTQ RTRSRERKEA
IIAEAVALAD STDWGPTTAR YRALMVEWKA AGRAAKDVDD ELWARFRAAQ DGFFSRRNAV
NAERDAEQIA NQARKEELLV EAAALDPVDV ERSLRRYREI QERWDAIGRV PREAVGSLER
QLNAIGDKLR DASDARWDRR DIAESPFLTK LRESVAKLEA KLERARAAGR AREITETENA
LTTQRAWLAQ AEKGS