Gene Franean1_2282 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2282 
Symbol 
ID5670681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2726183 
End bp2727367 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content67% 
IMG OID641241202 
Producthypothetical protein 
Protein accessionYP_001506623 
Protein GI158314115 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.212652 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGAA CAAAACTGGG CGTGGTGGCC CTGGGCGCGA CGGCGGCACT GGCACTGGCG 
GCCTGCTCCG GTGACTCGAC GACGGGCGAG AGTAGTTCCG GCGGTGAGCG GGCGCCGGCG
GTCGAGGCGG GTGGCGCCCT CGACCTCGCC GGGGTCTGCC CGGAGAACGT GGTGATCCAG
ACGGACTGGT TCGCCGAGTC CGAGTACGGC TTCCTCTACA ACCTGATCGG CCCGGACGCC
AAGATCGACA CCGGTGGCAA GCGCATCACC GGGTCGCTGG TCGCCCAGGG CAAGGACACG
GGTGTCAACG TCGAGGTGCG CTTCGGCGGG CCGGCCATCG GCTTCGAGCA GGTCAGCTCC
CAGCTCTACC TTGACCCGGA GATCGACCTG GGCCTGGTCT CCTCGGACGA GGCGATCCAG
AACTCCAAGG ACCAGCCGAC CACGGCCGTC TTCGCCCCGT TCGAGGTCAG CCCCATTATG
ATCATGTGGG ACAAGGTGAA GAACCCGAAC TTCCACACCC TGGTCGACAT CGGCCAGACC
GACACGAAGG TCCTGTACTA CGAGACCGAC ACCTACATGC AGTACCTGCT CGGCGCGGGC
ATCCTGCGGG CGTCCCAGGT CGACGGCAGT TACGACGGCG GCCCGTCGCG CTGGGTGACC
GAGGACGGCG CCGTCGCGCA GGGCGGGTTC GCCACCTCCG AGCCCTACAT CTACAAGAAC
GAGCTGGATG ACGGGCGCAG CTACGACGTC GACCTCCAGC TGATCAACGA CACCGGGTAC
CCGGTCTACG GTCAGGCGCT GTCGATCCGC TCCGGTGACA AGGAGACCCT CGCGCCCTGC
CTGAAGAAGC TGATCCCGAT CGTCCAGCAG TCGCAGGTCG ACTTCGTGAG CGACCCGGCC
GAGACCAACG CCCTGATCAT CAAGGCCGTG CAGGCCGACG ACGTCTCGGT GTGGAACTAC
TCGCCGGGCC TGGCCGACTT CGCCGTCACC ACGATGAAGG AGCGCGGCCT GGTTGCCAAC
GGGCCGAACG CGGCCGTCGG CGACATGGAG GAGGACCGGC TGGCCCGGAT GATCGAGATC
CTCGAGCCGA TCTTCACCGG CCAGCGCAAG GAGCTCAAGG CCGGCCTGGC ACCCGGTGAC
CTGTTCACGA ACGAGTTCAT CAACACCTCG ATCGGCCTCA AGTGA
 
Protein sequence
MRRTKLGVVA LGATAALALA ACSGDSTTGE SSSGGERAPA VEAGGALDLA GVCPENVVIQ 
TDWFAESEYG FLYNLIGPDA KIDTGGKRIT GSLVAQGKDT GVNVEVRFGG PAIGFEQVSS
QLYLDPEIDL GLVSSDEAIQ NSKDQPTTAV FAPFEVSPIM IMWDKVKNPN FHTLVDIGQT
DTKVLYYETD TYMQYLLGAG ILRASQVDGS YDGGPSRWVT EDGAVAQGGF ATSEPYIYKN
ELDDGRSYDV DLQLINDTGY PVYGQALSIR SGDKETLAPC LKKLIPIVQQ SQVDFVSDPA
ETNALIIKAV QADDVSVWNY SPGLADFAVT TMKERGLVAN GPNAAVGDME EDRLARMIEI
LEPIFTGQRK ELKAGLAPGD LFTNEFINTS IGLK