Gene Franean1_7303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7303 
Symbol 
ID5675604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8924909 
End bp8926108 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content73% 
IMG OID641246140 
Producthypothetical protein 
Protein accessionYP_001511528 
Protein GI158319020 
COG category[S] Function unknown 
COG ID[COG4301] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03438] probable methyltransferase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0723048 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCGA GGACACCGGC TACCGCCCCG GAGACGATTC CCGTGCCCGG CAGGCAGCCG 
CATCCCGCCA CGGCCACCGA CGGGTCCGAC CCGTCGGGCG GGCCCGACAA ACCCGCCCGA
ACGGACCAGA CCCCGCAGAT CACCGTGGAC CGCCACCTCA CCGCCGCCGA ACGGCACGCC
TCGCTCGCCG CGGACATGCG CGCCGGCCTG ACCTCCCACC CTCGTGAGCT GCCACCCAAG
TGGTTCTACG ACGCCACCGG CAGCCTGCTG TTCGACCGGA TCACCCGCCT GCCCGAGTAC
TACCCGACCC GCCGCGAGCA CGCGGTGCTC ACCGCGCACG CCGCCGAGAT CGCCGCCGTC
TGCCCGGCCG GCACCCTCAT CGAGCTCGGC TCCGGCACCT CGGAGAAGAC CCGCCTGCTC
CTCGACGCGC TGCGCGCCAC CGGGGTGCTA CGCCGCTTCG TCCCCTTCGA CGTGGACGAG
GAGACCCTGC TCCAGGCCGG ACAGGACATC CTGCGGGCGT ATCCGGGAAT CTCGGTGCAC
GCGGTGGTCG GGGATTTCGA GCGCCACCTC GGCCTTCTCC CCGGCGCCCG GCCCGCCGCG
GACACGGGCG CGGCGGCGGG CGCCGGTGCT GCCGGCGCTG ATGGCGGCGC TGCCGGTGTT
GATGGCGGCC ACGGTGGAGG CCGCGACGAC CGGCGGCTTG TGGCCTTCCT CGGCGGAACC
ATCGGCAACC TGCGGCCCGC GGCGCGCGCC GCCTTCCTGC GCGCCCTGAG CAACCAGTTC
ACCGACGGCG ACGCCCTGCT CCTCGGCGCC GACCTGGTGA AGGACCCGCG ACGCCTCGTC
GCGGCCTACG ACGACAGCGC CGGCGTGACA GCCGCCTTCA ACCGCAACGT TCTCTCAGTG
ATCAACCGGG AGCTGGGGGC CGACTTCGAC CTGCGCGGGT TCGCCCACGT CGCCGCCTGG
GACGCGGAGA ACTCCTGGAT CGAGATGCGC CTGCGCAGCG TCCGCGAGCA GGAGGTCGGG
GTCCGCGCCC TGGACCTGGT CGCCCGCTTC GACGCCGACG AGCAGATGCG CACCGAGATC
AGCGCCAAGT TCACCCTCGA CGCGATCGCC GCCGAGCTGG CCGCGGCCGG ACTCTCCGTC
AGCCACCAGT GGACGGACCC AGACGGCGAC TTCGCCCTGA CCCTGGCCGT CCCCTCCTGA
 
Protein sequence
MTSRTPATAP ETIPVPGRQP HPATATDGSD PSGGPDKPAR TDQTPQITVD RHLTAAERHA 
SLAADMRAGL TSHPRELPPK WFYDATGSLL FDRITRLPEY YPTRREHAVL TAHAAEIAAV
CPAGTLIELG SGTSEKTRLL LDALRATGVL RRFVPFDVDE ETLLQAGQDI LRAYPGISVH
AVVGDFERHL GLLPGARPAA DTGAAAGAGA AGADGGAAGV DGGHGGGRDD RRLVAFLGGT
IGNLRPAARA AFLRALSNQF TDGDALLLGA DLVKDPRRLV AAYDDSAGVT AAFNRNVLSV
INRELGADFD LRGFAHVAAW DAENSWIEMR LRSVREQEVG VRALDLVARF DADEQMRTEI
SAKFTLDAIA AELAAAGLSV SHQWTDPDGD FALTLAVPS