Gene Franean1_5027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5027 
Symbol 
ID5673365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6021572 
End bp6022699 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content74% 
IMG OID641243881 
Producthypothetical protein 
Protein accessionYP_001509296 
Protein GI158316788 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0475457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.219528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCGC CGAAGAAGCC AGCGAGTGCG GCCACCCTGG CAGCGCTGGG GACGCTGTCC 
GCCGCCACCC GCGGCCGGTG GCGGATCAAG GCTGGGCCGA GCTGGACCGA GCCGCTCGCG
CTCTACGTCG CCGCTGTAGC CGACCCCGGC GAACACTCCG GGACCGTCCT GCGTGCTGTC
GCCGCCCCAC TACTCGAGGC CGAGCGCCTG GCGCACGCCT CGCGCGCCGA TGACCACCGG
AAGGCACTCC GGCGGCACCA CGTCGCCCAG CGCCGGTACG AGACGGCCGT GGAGGCGGCC
GCCCAGGCGC CTGCGGCACA GGCGCGGGAG GCCAACGAGC GGGTGATCGA GGCGATCGAG
GATCTGGCCC GCCATCCCGA GCCCGATGAC CAGCGGATCA CCGTCGGCCA CGTCGAGCCG
AGGGCGCTGC CGGTGCTGCT CGCCGCCGAG GGAGGTGCCC TGGCCCAGCT CGCGACCAGT
GGTGGCCTGC TCGCTGCCAT CGCCGACCCA GGTGACGCCG AAGACGGCTG GGTGGCCCTG
TTCACGCTCC TGCGGGCCTA CGACGGCCAG GCGCTGTCCA TCGACCGGGA GCCCGACCTG
CACGTCCACG TTGAAGAGCC GTTCATGGCT CTCGTTGTGA CCCTCACGCC CGAGGAGATC
GCCAGTCCCA GCGCCCGCCA GCACCTGCGG GGCCTGCTGC CGCGGCTGCT GTTCGCCGCG
CCCGCACCGA TGGCCGGAAC GCGGACGACG GACTCGCCAG CGATGCCAGA CGAGGTGAAC
ACCGTCTGGG CCGGAGCCGT TCGTGGCGTC CTTGACGCCG CGCTGCGCGC CGACGGCATC
ACGATGGTGG AACTGGAGCC GGCCGCCCGG GAGATCTTCG AGGTGTTCCG CGCCGGGTGG
GAACCGCGAC TGCATCCCGA GACCGGCGAC CTCGCCGACA TCCAGACGTG GGCGACCAAG
CATCCGGGGC GGGCCGCGCG CATCGCCGCG CTGCTGGCGC TGGCCGAGGA CCCGGCGACC
ACCACGGTGG GCGTGGAGCA TGTGTGGGCC GCCGTGAACC TGGCTGAGGT CCACATGGCG
CACGCGCGGG TCGCGCTGAC CGGTGCGGGC GTCGAGGGGG CGGCGTGA
 
Protein sequence
MSSPKKPASA ATLAALGTLS AATRGRWRIK AGPSWTEPLA LYVAAVADPG EHSGTVLRAV 
AAPLLEAERL AHASRADDHR KALRRHHVAQ RRYETAVEAA AQAPAAQARE ANERVIEAIE
DLARHPEPDD QRITVGHVEP RALPVLLAAE GGALAQLATS GGLLAAIADP GDAEDGWVAL
FTLLRAYDGQ ALSIDREPDL HVHVEEPFMA LVVTLTPEEI ASPSARQHLR GLLPRLLFAA
PAPMAGTRTT DSPAMPDEVN TVWAGAVRGV LDAALRADGI TMVELEPAAR EIFEVFRAGW
EPRLHPETGD LADIQTWATK HPGRAARIAA LLALAEDPAT TTVGVEHVWA AVNLAEVHMA
HARVALTGAG VEGAA