Gene Franean1_0869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0869 
Symbol 
ID5669283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1014808 
End bp1015824 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content78% 
IMG OID641239796 
Productputative transcritional regulator 
Protein accessionYP_001505231 
Protein GI158312723 
COG category[S] Function unknown 
COG ID[COG2912] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.202416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.388652 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCCC ACAGCAGGCG GATGTTCGCC ACCGTCGTCC GCCGCGAACC CGTCGACCTG 
GCACTCGCGT GCCATCTGAT CGCCGCCGAG GCCGGCCCGG AGACGAACCC GGCCGAGACG
ACCCGTGCCC TCGACGCCCT GGCTGCCGAC ACCGCCGCCC TCCTCGCAGC CCGCCGGGCC
GCTCCGCGGG GCCGTGGCGC GGCTGGTGCC GGTACCGACA CCGGTGCCGC TTCCGGGGCC
GGGGCCGCCT CTGAGGCCGT TTCCGGGGCC GGGTCCACTT CCGGGGCGGT TCCCGGGGCC
GCCTCCGGGG CGCCGGCGGA GCGGGGCACC GCGGCCGACG CGGGCACCGA CGCCGGCGGG
GGCACGCTGA CCGGGCTCGA CCTGCGCGCC GCCGCCGAGG CGCTGCGGGA GTCCCTCGGC
GAGCGGGCCG GGTTCGCCGG GCACGAGTCC GACTACGACG ACGTTCGGGC CTCGCTGCTG
CCCGAGGTCA TCAGCCGGCG CCGCGGCCTG CCGATCCTGC TGTCCATCGT GTGGATCGAG
GTCGCGCGCA GAATCGGCGT CCCCGCCTAC GCCGTCGGGC TGCCCGGCCA CGTGATCGTC
GCGGTGGGCG CGCCCCATGA GAACGTCCTC GTCGACCCGT ACGCCGGGGG GGAGATCATG
ACGGTGCACG ACGCGGCGGC ACGGGTGCGC GCGGCCGGCG CGGCGTTCAC CCGGGCCCAG
CTCGCCCCGA TGACCCCGGA CGACCTGCTC ACCAGGGTCC TGAGCAACAT CCGGGTACTC
GCGGCACGCA CCGACGTCCC ACGCACCAGG CTGTGGGCGG TGGAGCTCTC GCTCCTGCTG
CCCCGCCACC CCGCCGTGCT GCGGCGCGAG CGAGGCGAGC TGCGCGTCCG GCTGGGCGAC
TTCCTCGGCG GCGCGGCGGA CCTGACCAAC TTCGCCGACG CGGTCACCAC CGTGGAACCC
GCCGCCGCCG CGGCCGCCCG GCACGCCGCC GCGGCCGCCC GCGCCCGCCT GAACTGA
 
Protein sequence
MSAHSRRMFA TVVRREPVDL ALACHLIAAE AGPETNPAET TRALDALAAD TAALLAARRA 
APRGRGAAGA GTDTGAASGA GAASEAVSGA GSTSGAVPGA ASGAPAERGT AADAGTDAGG
GTLTGLDLRA AAEALRESLG ERAGFAGHES DYDDVRASLL PEVISRRRGL PILLSIVWIE
VARRIGVPAY AVGLPGHVIV AVGAPHENVL VDPYAGGEIM TVHDAAARVR AAGAAFTRAQ
LAPMTPDDLL TRVLSNIRVL AARTDVPRTR LWAVELSLLL PRHPAVLRRE RGELRVRLGD
FLGGAADLTN FADAVTTVEP AAAAAARHAA AAARARLN