Gene Franean1_1387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1387 
Symbol 
ID5669795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1678872 
End bp1680320 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content67% 
IMG OID641240313 
Producthypothetical protein 
Protein accessionYP_001505740 
Protein GI158313232 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGTGC TGGCGCGCAC GCGGCTCTCA CCGAGAGGGG GCCTGCTGTC GCTCACCGAG 
CATCCGCTGC AGCGGGCCGG GGCGTGGGCG ATCGCGGTGT TGGCGGGTCG TGAGCGGCCG
AGCGAGGTGA CGGCTGCGGA TCTTGATGAG GTGGCGGGCC GGATCACGGC GGATGTCGTA
CGCGCTGCCG TCGCTGGGAA GGACTACACA GCGTACGACT GGTGGAAAGT CCTGTTCGCG
ATGTTCCCGA ACTCGGAGCC GACCCATGCC GGCCGTCCCC GGGATCGGGA GGCCCTGATC
GGCACGGTCG CTCGCTTCTT CCTCCCCGAC CTTGAAGGCG AGTCTGCGCG GCCGTGCTGT
TTCTGCGGGG CCGCCGCGAG TGTCCTGTGG TCGAAGAAGA TGCTTCCATT GTTTGATTCG
ACGAAGGCCG TGAATACATT GCCGCCGGCG ACGGCTGGGT GGCCGGTCTG CCGGGGCTGC
CGGATCGCCT GGTGGGCTAT GCCGTACGGC GCGAATGTCA CTGCGGGTTC GGCGACGGTG
CTCTCCTGTG ACGAGGAGGC GGTTGAACGG GCTTTCGCCG CCGCGGGATG TGTGCGGACG
GCCCGTGTGC GCTCGGTCGG TTTCTCCTCG TTGTCCGCGG ATGCGTCACC GGAGGCGGTC
ACCCTCTGGG TGCTACGTGA TCATGCTTCG AGTAGTCGGC CGGTGGCGGC GACGCTGTGG
AGTTTCAAGA ACGACAATCA GGAGCCCTGG CTGCGGGTGA GTGAGACCCG GATCGCGGTG
GTGGGTTTCC TGCGTGCCCT GCCGGCTGAT CCCGAGGCGC ACCGCGGCTG GCGCACCCTG
CAGCGTCAGT GCAACAGGGT GGATAGGAAA GGCACGGTGG TCCGACTCGG GCGCGACGTC
GTCGCTCGCG CCCTGTTCGA CTCCGTCAAT TTTCCGCCGG ACCAGCTGTG CCGGGAGCTG
TCCCATCAGG TCGACGATCT CGGCCGCGTG TCGGCGTCGA CGATTCGCGC CTGGCGAGCG
CTGTACGCGC TGTATCTGAA GGAGATGTAC GGGATGGACC GCAAAGCCCT TGAACCGGTG
ACGACGCTGC TGGTCGACTG GATCGGGGCG GAGAAGAATC CCCGGGGACG TTTCAACGAG
TACCGGAAGG TCGCGGGACG CGGTTTCGAC CTTCAGGTTC TCCTCATGGG AGCGTGCGCC
CGCCTGTATC TGGACGGCCA GAAGCCGCCG GACGTCACGA GGATCACGGA GCATCTGCTG
GCGTCCGGGC AGGAGGGGCA CCGGCTGCGC GGGCAGTTGT TCTTCGAGGT GGTCGCGGAG
CTGGTCGGAC GCGGCGTGGA GATCGGCGCA CGTACCACCG GGGCGGCGAC CGAGGCGGGG
ACGGAAACAG AGGCCGACGG TCCGCTCGTC GGTGTGGATG CCGACGAGGA CGAGGAGGCG
TACGTCTGA
 
Protein sequence
MIVLARTRLS PRGGLLSLTE HPLQRAGAWA IAVLAGRERP SEVTAADLDE VAGRITADVV 
RAAVAGKDYT AYDWWKVLFA MFPNSEPTHA GRPRDREALI GTVARFFLPD LEGESARPCC
FCGAAASVLW SKKMLPLFDS TKAVNTLPPA TAGWPVCRGC RIAWWAMPYG ANVTAGSATV
LSCDEEAVER AFAAAGCVRT ARVRSVGFSS LSADASPEAV TLWVLRDHAS SSRPVAATLW
SFKNDNQEPW LRVSETRIAV VGFLRALPAD PEAHRGWRTL QRQCNRVDRK GTVVRLGRDV
VARALFDSVN FPPDQLCREL SHQVDDLGRV SASTIRAWRA LYALYLKEMY GMDRKALEPV
TTLLVDWIGA EKNPRGRFNE YRKVAGRGFD LQVLLMGACA RLYLDGQKPP DVTRITEHLL
ASGQEGHRLR GQLFFEVVAE LVGRGVEIGA RTTGAATEAG TETEADGPLV GVDADEDEEA
YV