Gene Franean1_3896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3896 
Symbol 
ID5672257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4660545 
End bp4661762 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content75% 
IMG OID641242775 
Producthypothetical protein 
Protein accessionYP_001508192 
Protein GI158315684 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.103005 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCACG GTGGTGACGG GACGGGCCAG CACGCGGACG AGGGCGACAT CCCCGCCCAC 
CGCTCACCGT GGGCACGGCC CGCATCGGGC GAGTGGCCCG CATCGGGCGA GTGGCCCGCG
CCGGGTGGCG ACCAGCCGAC GATGGACCTG CACCACCGCG CCGGCGCGGC GGCCGCCACC
GGCCCGCGCT CGGGGCAGCC GGCCCCCCCG CCCCCCGCCG GGGCGTTCCA ATCGGGCCGG
CCGGCGTGGC CGGCGGCCGG CGGAACGGGG GGCACGGACC GGTGGGAGGA GAACCACCAG
CCCGACGAGC CCGACGAGCC CGGCCGCCCG CCGATCCCTC CGACGATGGC CTACGGATCC
GCCCCGGACG ATCACCGGCT GCTGGCCGGG ACGACCCCGT CGGCCGGTAC GACTCCGGCG
CGCCGGCGCC GGCGCCGGCG CCGGTGGCTG TGGGCCCTGG TCGCGGCGGC CGTCGGCTTC
GTGCTGCTGG TGGTGGGTGA CCGGGCCGCG GTCTCGGTCG CCGAGGGCCA GATGGCCAAG
CAGATCAAGG TCAGTGTGGC GGAGAGCCTG GAATGCGGTG CCCCCGCGCC GACCGTGCGC
GACGTGCACA TCGGCGGCTT CCCCTTCCTC ACCCAGATCC TGCTCGGCAA GTTCAAGGAG
ATCGGGGTGA CGATCGAGGG AATCGCCACC CCGGGGCCCC GGATCTCCGC CGTGCAGGCG
CAGCTGTCCG GCATCCACGT GCCGCTCGGG GACATGATCT CCGGGTCGGT CGGCGCGGTG
CCTGTTGACG ACATCCGGGC GACTGTCCGG CTTGACTACG CCGACCTCAA CACCTACCTG
GCCGGGCTGC CCGGCGCCCT CCAGGTGAAC CCGGTCGACG GCGGGCGGCG GGTCGAGATC
TCCGGGCGCA CCGACCTGTG GCTGTTCGGC TCACAGGAGA TCGGGGGCGT CACCACCTTC
GAGGTCCGTG ACAACGTGCT CACGCTCGTC CCCAGTGAGG TGACGCTGCG CGGGGCCATC
AACGCCACGA TCCCCGTGCC CGTGGGCGGC CTGCTGCCCC CGATCAAGAT CCCGGTCGGC
CAGCTGCCGC TGGATCTCGA CATCGTCGAG GCGTCGACGG GCGGGTCCGG GCTGTCGCTG
ACCGCCGCCG CCCACGACGT CGTCCTGCCC GCGGCGGAGC AGCCCGCGCC GCGTCAGTGC
CCGCCCGGCA ACACCTGA
 
Protein sequence
MNHGGDGTGQ HADEGDIPAH RSPWARPASG EWPASGEWPA PGGDQPTMDL HHRAGAAAAT 
GPRSGQPAPP PPAGAFQSGR PAWPAAGGTG GTDRWEENHQ PDEPDEPGRP PIPPTMAYGS
APDDHRLLAG TTPSAGTTPA RRRRRRRRWL WALVAAAVGF VLLVVGDRAA VSVAEGQMAK
QIKVSVAESL ECGAPAPTVR DVHIGGFPFL TQILLGKFKE IGVTIEGIAT PGPRISAVQA
QLSGIHVPLG DMISGSVGAV PVDDIRATVR LDYADLNTYL AGLPGALQVN PVDGGRRVEI
SGRTDLWLFG SQEIGGVTTF EVRDNVLTLV PSEVTLRGAI NATIPVPVGG LLPPIKIPVG
QLPLDLDIVE ASTGGSGLSL TAAAHDVVLP AAEQPAPRQC PPGNT