Gene Franean1_0821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0821 
Symbol 
ID5669237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp959033 
End bp960151 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content75% 
IMG OID641239750 
Productsignal peptide 
Protein accessionYP_001505185 
Protein GI158312677 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.582134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGGCAG AGCGCGGACG CGGCGACGGA CTCGGAACCC ACCGGCGCCG GCGGTGGGTC 
GCCGCGGCCG CGGTGGCGGC GGCACTATGC TCGGTCGCCC CGACCGTCCC GACGCTGGCG
GCCGCCGCCG CCGGTGACCT CCCGGCCGGC GGTGCCACCG AAACCGTCCC GCCCGCGCCC
TTCTCGGACC TGCCCGGCCC CTTCGCCGGC CCGTACTCCG AACCGCTCGG ACGTCCCTAC
GCCGAGCCGA TGTCGGCGCC GGTCGCGGCG AGCCCCTACG AGCAGTCCCG GATCGAGCGG
TACTGGTCCG CCGACCGGCG CGCCCGGGCG CAGACAGCGG ACACGGCCGA GCGCGGCGAC
CGGCCGGCTC CGTCCGCGCT GGACGGGGCC GCCACGGCCG GCGAGGACCA GCGCGACGCC
ACGCCGCCGC CGCCGAGCAC CGGCGCCCCC TACGTCTACG GCGGGCTGGC GACGAAGACG
GTCGGCCGGT TGTTCACCAC GCTGCGCGGC GTCGACTACG CCTGCTCGGC GACGGTCGTG
TCCAGCCCGG GCCGCGACCT GGCCGTGACC GCGGGCCACT GCCTGCACGA GGGCACGGGC
GACCAGTTCG CGACGAACGT CGTCTTCATG CCCGGATACT CCGAGGGACG GATGCCGTAC
GGGCTGTGGA CGGCCCGCCG GATCACTGTC ACCCCGGGCT GGGGCCTGGA CGGGGACTTC
GACTACGACA CCGGGTTCGT CCTGTTCAAC GCGCGCGGGG GCCGTCACCT CGAGGACGTC
GTCGGCGCCC AGCGCATCGC CTTCAACCAG CCCCGCACCT TCGCGCAGTA CGCGTTCGGA
TACCCGCGGC TGGCGCCCTA CGACGGCAAC CGGCTGGTCT ACTGCGCCGG GGCACCCTCC
CCCGACCCGT ACGGGACGGT GTCGCTCGGG CTGAACTGCG ACATGACCGG TGGCGCCAGC
GGCGGACCGC TGATCATCGG GCTCGGCCGG GCCGGGCCCG GGGCCGGCTG GGTCGACAGC
GTGGTCAGCT ACGCCTACGT CGGCGAGTCG CAGACCATCT ACGGCACCTA CTTCGGGCGG
GCGATCGAGC TGCTGTACTA CCAGGCGATG GAACTCTGA
 
Protein sequence
MVAERGRGDG LGTHRRRRWV AAAAVAAALC SVAPTVPTLA AAAAGDLPAG GATETVPPAP 
FSDLPGPFAG PYSEPLGRPY AEPMSAPVAA SPYEQSRIER YWSADRRARA QTADTAERGD
RPAPSALDGA ATAGEDQRDA TPPPPSTGAP YVYGGLATKT VGRLFTTLRG VDYACSATVV
SSPGRDLAVT AGHCLHEGTG DQFATNVVFM PGYSEGRMPY GLWTARRITV TPGWGLDGDF
DYDTGFVLFN ARGGRHLEDV VGAQRIAFNQ PRTFAQYAFG YPRLAPYDGN RLVYCAGAPS
PDPYGTVSLG LNCDMTGGAS GGPLIIGLGR AGPGAGWVDS VVSYAYVGES QTIYGTYFGR
AIELLYYQAM EL