Gene Franean1_1953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1953 
Symbol 
ID5670354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2348991 
End bp2350037 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content71% 
IMG OID641240874 
Producthypothetical protein 
Protein accessionYP_001506296 
Protein GI158313788 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.098143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.528904 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACG TCCTCGCGCT GGCCGTAACC GTGCTGCTGC TCGCCGGCAA CGCCTTCTTC 
GTCGGGGCCG AGTTCGCGAT CATCTCCGCC CGCCGGGACA CCATCGAGCC GATGGCGCTC
GCCGGTTCCC GGGCGGCGAA GGTGACCCTG AAGGCGATGG AGAACGTGTC GCTGATGCTG
GCCGGAGCCC AGCTCGGCAT CACGGTCTGC ACGCTCGGTC TGGGCGCGCT GAGCGAGCCG
GCCATCGCGC ACCTGTTGGA GGGGCCGTTC GAGGCCGTGG GCCTGCCGCT GTCGCTGCGC
CATCCGGTGG CGTTCGCGAT CGCGCTCGCC GCCGTCACCT ACCTGCACGT GGTGATCGGT
GAGATGGTCC CGAAGAACAT CGCGCTGGCC ATGCCGGACC GGGCGGTCCT GCTGATGGCC
CCGCCGCTGG TCGCGGTCGT CCGGGTGGTG AAGCCGGTGA TCTCGATCCT CAACCGGATC
GCGAACCTCT CCCTGCGGGC GGCTCGGGTC GAGCCCAAGG ATGAGGTGAC CAACGTCTAC
ACGCGCGACG AGGTGGCCGG GCTCATCGAG GAATCACACC GCGAGGGCCT GCTGGCGGAG
GACGAGCACG ACCTGCTGAC CGGCGCGCTG TCGTTCGACG AGCGCACCGC GCGCAGCGTC
CTGCTCCGCC CGGACAGCCT GGTCACCGTG CCGCCGTCCA TCACGCCCCG CGAGGTCGAG
CGGCTCGCGG CCGACACGGG CTTCACCCGG TTCCCGGTCC GCGGGGACGA CGGTGACCTC
GCCGGCTACC TGCACCTCAA GGATGTCCTG GAGAACCGCG AGGACCGACG TTCGGCCCCG
GTGGCGGCCA AGTGGATCCG GCCGCTGGTC CGCGTCGGGG CGGACGACAG CCTGCGCACG
GCGCTGGCCA CCATGCAGCA CTCGGGATCG CACCTGGCCC GGCTCTCCGA CGGCGAGGGC
CGGATCCTCG GCCTGGTGGC GCTGGAGGAC ATCCTCGAGG AGCTGGTGGG CGAGATCCGC
GACGAGGCGA CCCGTCAGCG CGCCTGA
 
Protein sequence
MNDVLALAVT VLLLAGNAFF VGAEFAIISA RRDTIEPMAL AGSRAAKVTL KAMENVSLML 
AGAQLGITVC TLGLGALSEP AIAHLLEGPF EAVGLPLSLR HPVAFAIALA AVTYLHVVIG
EMVPKNIALA MPDRAVLLMA PPLVAVVRVV KPVISILNRI ANLSLRAARV EPKDEVTNVY
TRDEVAGLIE ESHREGLLAE DEHDLLTGAL SFDERTARSV LLRPDSLVTV PPSITPREVE
RLAADTGFTR FPVRGDDGDL AGYLHLKDVL ENREDRRSAP VAAKWIRPLV RVGADDSLRT
ALATMQHSGS HLARLSDGEG RILGLVALED ILEELVGEIR DEATRQRA