Gene Franean1_3454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3454 
Symbol 
ID5671825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4083569 
End bp4084990 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content67% 
IMG OID641242342 
Producthypothetical protein 
Protein accessionYP_001507762 
Protein GI158315254 
COG category[S] Function unknown 
COG ID[COG5361] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGAC AGCGGGATCA GGTGGCTGCC ATAGCAGTCG AAGCGTACAT TTTCGCCTAT 
CCGCTCGTGA CCATGGAGCT GACCCGACTC CAAGCCACCA ACGTCGAGCC CGGAGTGGCG
CCGGGCCGGG CCCCGATGAA CCAGTTCGCA CATATTCGGG AGTTTCCCGA CGCCGATTTC
AGGATGGTCG TCCGGCCGAA CTTCGACACC CTCTACTCCT CGGCCTGGGT GGATCTGACC
GAAGGACCGG TGGTGGTCTC CGCACCCGAC ACGGATAATC GCTACTACAT GTTGCCCATT
CTCGACATGT GGACGGATGT CTTCGCCACC CCCGGAAAGC GCTCCAGCGG CACGGCCGCG
GCGGACTGGG CGCTGGTGCC GGCCGGGTGG AGCGGGCGCC TGCCGGCGGG CGTGGGACGC
ATCGACGCTC CGACTCCGCA CGTCTGGATC ATCGGCCGGA CGCAGACCAA CGGCGAGGCC
GACTACGACA CCGTCCACAA GGTGCAGGAC GGATTCCAGC TCTCCCACCT CGCGGACTGG
GGGCGCGCTC CGATCGCCGC GACCGCCCGG GCTGTCGACC CGGACATCGA CATGACGACG
CCGCCCCTGG ACGTCATCAA CGCCATGACC GGCGAGGAGT TCTTCAGGCG CGCGGCAGAG
CTGATGAAGC TTCACCCGCC ACATGTCACG GACTGGTCAC AGATCCGGAG AATGCGCGCG
CTCGGCCTGG TTCCCGGCGA GTCCTTCGAC CCGAACCGCC AGGGCCGGGC CGTTCGGGAT
GCCGTCGCAG CGGCGCCCCG GACCGCTCAG AAAGCGATGA CCACGCGAGT TTCGACAATA
GCGACCGTGT CCGACGGATG GCAGACCAAC ACGGACTCGA TCGGCGTCTA CGGCAACTAC
TACATGAAAC GGGCCGCCGT CGCGATGATC GGTCTCGGCG CCAACCCCGC AGAGGAAGCC
GTCTACCCGC TGCTGCTCAC TGACGCGGAC GGCGACCCGC TCGACGGATC CGTCGACTAC
GTGCTCCACT TCGAGCGCGA CGAGCTCCCT CCGGTCTCCG CGTTCTGGTC GATCACGATG
TACGACGAAC GCGGCTTCCA GGTGGCCAAC CGGCTCAACC GGTTCGCCCT CGGAGACAGG
GATCCGCTGA CGTACAACGC TGACGGATCG CTCGATCTCC ATATCCAGAT GCGTCCCCCG
GATCCGTTCG GGAATCGAAC TGGCTGCCGG CCCCGCTCGG CCCGCTGGGT GTCACGATGC
GGCTCTACGC ACCCGACCCC GCGGTCCTGT GCGGAGCATG GTCACCGCCC CCGGTACGGA
AGGCCGCGAG CCGCCCCGGC TGACAGCTCC CCGGCGGATC CCACCGACGG GCCGAAGGTC
CGGCGTCCCA GGCTTCGGCC CCCGGCAGGA TCGTTGTCGT GA
 
Protein sequence
MTGQRDQVAA IAVEAYIFAY PLVTMELTRL QATNVEPGVA PGRAPMNQFA HIREFPDADF 
RMVVRPNFDT LYSSAWVDLT EGPVVVSAPD TDNRYYMLPI LDMWTDVFAT PGKRSSGTAA
ADWALVPAGW SGRLPAGVGR IDAPTPHVWI IGRTQTNGEA DYDTVHKVQD GFQLSHLADW
GRAPIAATAR AVDPDIDMTT PPLDVINAMT GEEFFRRAAE LMKLHPPHVT DWSQIRRMRA
LGLVPGESFD PNRQGRAVRD AVAAAPRTAQ KAMTTRVSTI ATVSDGWQTN TDSIGVYGNY
YMKRAAVAMI GLGANPAEEA VYPLLLTDAD GDPLDGSVDY VLHFERDELP PVSAFWSITM
YDERGFQVAN RLNRFALGDR DPLTYNADGS LDLHIQMRPP DPFGNRTGCR PRSARWVSRC
GSTHPTPRSC AEHGHRPRYG RPRAAPADSS PADPTDGPKV RRPRLRPPAG SLS