Gene Franean1_4241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4241 
Symbol 
ID5672596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5049153 
End bp5050283 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content76% 
IMG OID641243114 
Producthypothetical protein 
Protein accessionYP_001508531 
Protein GI158316023 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.468194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGACG GACGTCCGGC GCAGGAGGCT CGGCGGGCCG GAGGCCGGGC GGAACGGGCC 
ATCCGGATCA CCACCGTGAT CGCGGTGGCC ACGGTCGCGG CCGTGGCGGG TTTCGTCTCC
TACCGCCACA TGCGCGGCGT CGCCCTGCAA TACGGCGAGG ACGCGATGAC CTCGGCCGTT
CTCCCGTTCA GCGTGGACGG GCTGATCGTG GCCGCGTCGA TGACGATGCT GGCCGACCGG
CGGGCCGGCC GGCGGCGTTC CTGGCTGTCC TACACACTGC TGATGCTGGG GGCGTGCGCG
TCACTGGCGG CGAACGTCCT GCACGCCGAG CCGACCACCG CAGCCCGGAT CATCGCCGGC
TGGCCTCCGC TGGCGCTGCT CGGCTCGTAC GAGCTGCTCA TGCGCCAGAT CCACCCGACC
AGCCGGCGGA CAGCCCGCCA GCAGGCCGCC GCCGCCGCGG CGCCGGCCGA CGTCCCGGCT
GTCGCTCCCC AGGCTGATGG CCCAGCTGCC GGTGGGCAGC CAGGCGGTGG GCAGCCGGTT
CCGGCGCCGG GGTCCATTCC GCCGCAGCGG CAGCGGATGC CCGCCGTCGG CGGCGACTTC
CCGGCTGGCA CCCCCGCTCC CACCGCGACA GCTCCCGCCG CAGCCGACAA CGCTGTGGCC
GACAACGCGG TGGCGGTGGT CGACCTCTCG GTGACCGCAC CCATGGTCAC GCCCGCCCCC
GCGACGGCCC AGGCCCTCGA CCCGGCACCC GGGCGGACGC CTGCGCCGGG CCCGGCCGTC
CCGACAGGTG CCTCGACAAC CGTCCCGACA GGCGGGCCGG CGTCCACGGC CGGGGCCGGC
TCCGCCGGCG GAGTGGACAC CGCGGACTCC TCGGTGAAGC GTGAGGCGAT CATTCGCGCG
CTGGACGAGA CCGGCGGTTC GGCGACCGCG GCGGTCACCC TGCTCGGCCG GTGGGGCATC
ACGGTGAGCA AGAGCTGGGT GTACCAGGTG CGCAAGGAGA CCCGGCACGC CGACGTGCAG
ACCGGGCCGC TGGTCATGCC CACGCACCGT GCCCACCCCG CCAGCCGTGG GCGGCGGCGC
GGCATGGCGC CCGACCGGCC GCTCGTGGCA CCGACGACGA CCAGGGGCTG A
 
Protein sequence
MVDGRPAQEA RRAGGRAERA IRITTVIAVA TVAAVAGFVS YRHMRGVALQ YGEDAMTSAV 
LPFSVDGLIV AASMTMLADR RAGRRRSWLS YTLLMLGACA SLAANVLHAE PTTAARIIAG
WPPLALLGSY ELLMRQIHPT SRRTARQQAA AAAAPADVPA VAPQADGPAA GGQPGGGQPV
PAPGSIPPQR QRMPAVGGDF PAGTPAPTAT APAAADNAVA DNAVAVVDLS VTAPMVTPAP
ATAQALDPAP GRTPAPGPAV PTGASTTVPT GGPASTAGAG SAGGVDTADS SVKREAIIRA
LDETGGSATA AVTLLGRWGI TVSKSWVYQV RKETRHADVQ TGPLVMPTHR AHPASRGRRR
GMAPDRPLVA PTTTRG