Gene Franean1_0654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0654 
Symbol 
ID5669071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp764443 
End bp765618 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content65% 
IMG OID641239581 
Producthypothetical protein 
Protein accessionYP_001505019 
Protein GI158312511 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.613109 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.861609 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGTATG GGAACCACCA TGTGTTCCGC CGCTCACGTC TGCGCGGGCG TGACCGGCCT 
GGGGGGTTGG TCGTAACCAA GAAAGGCCTC GAGAGAGGGG CGTCCGTGCG AACGCTCGAA
CCCACCACCC TCGACCTCGA CGCGATCGAG GTGGACAAGC TCAACAAAGG CGTCACGCTC
AGTCTGCTGA TACCCGCTCG CGGGCCTGAG GCGGCGCGGA CGCTGGGAAC CATCGTCGCG
CTCAACCGGC AACGCTGGAT GCAGGAACGC AGTGTCCTCG ACGAGATCGG GGTCATAGTG
GACCCCTCCT CGGACGGAGA CGAGCACGAT CTCGTCCGGA TCGCGACCGA GGCGGGCGCG
AGCTGGGTCG TGCGCGGCCA GTCCGTCCTG GAGCTCCACG CGGGGCGGGA GGCGGCCACG
ACCGGAGGCA AGGCGGGTGC GATGCGCAGT CTCGCCTACC TTGCGTTCGG AAATCGCCTC
ATATTCCATG ACGCGGACCT CGAAAGCTAC GACCCCGCCA CCGTAGGTGT TCTCGCCGCC
GCCGCGACGG CTGGCAACGC CCCGCTTTTC GTGAACGGAA GCTCGCGCAG GGTCACCGGG
GACGGTCAGC CCGGTGGACG TACGACCGAG ATGCTGCGCT CGCTGCTCTC GAAACAACTC
GCGCGCTATG TGCCGTCCAT CACGCGGGCA ATCCAGCCGT TGATCGGGGA GTTCGTCATC
GACGCCGACG TCTTTGCGGC GCTCGCGTTC TCCCGCGGCT ACGGGGTGGA GACCTCGCTG
AAGGTTCTGG CTCTCGACCT TCTCGACTAC TCGGACTGTC TCCAGGTTGA GCTGCCGATC
AAGTATCAGG TTGGTCAGCA CTACCACAAC CTGGTCAAGC AGTTCCATGA GATCAGTTTC
ACAATCGACG TTCTCGAGGC GTACTTCCAA CGCCGCCGGG TCGATCCGCA CGTCGCCATC
TGGCAGATTG CCGACAACTA CGACCTCCTA TTCCCGGGGC GTACGTACGC CCTGCACCGA
CCGCCGGGTT ACGACGACAT CGACTTCGTC CCGCGGCTGG GCTTCTACCA GCCGCTCGCG
ACGTCGTCGG CCTACCAGGC CCGGCTACCC GAGATCAAGA GTGCTCGCCG TGTCGCACTG
GACACGCTGA GTAGGCGGAT GCGGGCCTCC GCCTGA
 
Protein sequence
MLYGNHHVFR RSRLRGRDRP GGLVVTKKGL ERGASVRTLE PTTLDLDAIE VDKLNKGVTL 
SLLIPARGPE AARTLGTIVA LNRQRWMQER SVLDEIGVIV DPSSDGDEHD LVRIATEAGA
SWVVRGQSVL ELHAGREAAT TGGKAGAMRS LAYLAFGNRL IFHDADLESY DPATVGVLAA
AATAGNAPLF VNGSSRRVTG DGQPGGRTTE MLRSLLSKQL ARYVPSITRA IQPLIGEFVI
DADVFAALAF SRGYGVETSL KVLALDLLDY SDCLQVELPI KYQVGQHYHN LVKQFHEISF
TIDVLEAYFQ RRRVDPHVAI WQIADNYDLL FPGRTYALHR PPGYDDIDFV PRLGFYQPLA
TSSAYQARLP EIKSARRVAL DTLSRRMRAS A