Gene Franean1_0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0053 
Symbol 
ID5668479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp66165 
End bp67283 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content69% 
IMG OID641238982 
Producthypothetical protein 
Protein accessionYP_001504427 
Protein GI158311919 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCCG TAGCCCTGGC TTTGGCCTGC TTCTTCCCCT TGATCGGCCG TCCCCGCCTG 
GCCTGTCAGC CTCTGCCCGA TCGAGTTGCG GAGATCGCCG AGATCGCCCA AGCGGCGGCC
CGGGACGGTG CGGACGGACT GGCCGAAGGA GCACATGCCC TGAACAAGGC GGCGCTGCTC
GCCAGCGACT GCGGTCTGGC GCCCCTCGCC CGTGACCTGT GCTGGCAGCA CATCAACATC
TACTGCGCTG TCCCGCGCCC GCTCACCGTC CACGAGGCCC GATACATGCT CGAACCGGCC
CTCAACCTCG CCCGGCTCCA GATCCGCGCC AGCGACGGGG AACAGGCGCT CGGGCTACTC
ACCGCGATGT TCCAGGCCGT CTCGTCGAAC ACCGACCTGG TCGTCGACGG CCGGGTCCTT
CCGCTGACTG ACCTCATCGG CACCCGCGAC GAGCGCCACA AGCTGCGCGA ATGGGTGTGG
CTGCACCTCG TCGGGGACGG CGTCCGTGCC CTGGCACTCG CCGGCCGCTG GGACGATGCG
GTCATCCACG CAGACACGTA CCGAGGGATC GGACTGCATC TCCTGGAAGG CCGCCAGGCG
AAGATCCTCG CTCACTGCCT GACCGGGACG TCAGCCGAGG CCCGCGCGGC CCTGGCGGAG
AGCACGCCGA TGTACCCGTG GGAGCTCCAG GTCGCCTCGT GCCTGGAGGT GATGTGCACC
GAGGACACAT CCACAGCACA CGGTGTCACC ACCATGATCG GGCAGTTCCT GGGACAACGA
CCGATGCCCG GCTACGCGGT CTTCCGTGCC CACCTCGGCA TGACCGTAGC CGCTCTCGCC
GCCACCACCG ACCCAGACGC CGCCACCCGC GTTCTCACCC AGACAGTCGA GGAAGTGATC
GAAGCCGAGG ACGGGTACGC GGCACGGGAC GTTCTCCGGC TTCGCCCCAC ACAAGCGGTC
GACCTGCCAG CCAGGCACGA AAAGGCGCTC GCCGACCTAC TCAACGCCTC CGGCCTACGA
GCAGAAACAC CGCCGGAACC GGTCCTGGAG TCTGTTCTCG GCTCCGCCCG GACCGCCGAA
GCCGCGATCG TCGCGGCGAC ACACCCCCAG CGACGATGA
 
Protein sequence
MNPVALALAC FFPLIGRPRL ACQPLPDRVA EIAEIAQAAA RDGADGLAEG AHALNKAALL 
ASDCGLAPLA RDLCWQHINI YCAVPRPLTV HEARYMLEPA LNLARLQIRA SDGEQALGLL
TAMFQAVSSN TDLVVDGRVL PLTDLIGTRD ERHKLREWVW LHLVGDGVRA LALAGRWDDA
VIHADTYRGI GLHLLEGRQA KILAHCLTGT SAEARAALAE STPMYPWELQ VASCLEVMCT
EDTSTAHGVT TMIGQFLGQR PMPGYAVFRA HLGMTVAALA ATTDPDAATR VLTQTVEEVI
EAEDGYAARD VLRLRPTQAV DLPARHEKAL ADLLNASGLR AETPPEPVLE SVLGSARTAE
AAIVAATHPQ RR