Gene Franean1_1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1604 
Symbol 
ID5670007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1920069 
End bp1921133 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content75% 
IMG OID641240523 
ProductAraC family transcriptional regulator 
Protein accessionYP_001505949 
Protein GI158313441 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGACGG TTCTTGATCT CGACCACCTG CCGGCGCACC GGCGCGCCGA GGTCGCCCGG 
GACGCCATCG TCGCGCTCAC CGTCACCAGC CGGTTCAGCG TGCACGCCCA GCACGACATC
CGGGGCCGGA TCGAGTCGTG GATCTTCGGC GAGGTCGGCC TGATCCGCAC GACCGTCAAC
GCCGGCCAGC ACATGATCCG CACCGCCCGG CACGTGCGCC AGGACAGCGC TCCCCCGGTG
CTGTCCTTCG CGCAGCGCCG GCACGGGCGC GCCGTCCAGG AGCAGTTCGG GGTCCGCCGC
GAGCTGGCCG CCGGCGAGCT GTACTGCACC GACCTGAGCT CCGCGTTCGA GTACGCCAAC
GCCGAGGGCG GATGCGGCCA GGGCCTGCAG GTACCGCTGG CCTCGCTCGG CCTGCCCGGC
GAGGTCGTCC GGCGCGGCGC CCCGAACCTC GCCCGCAGCC CGCTCTACCC GCTGGTCACC
GACCATCTCA CCGCGCTGGC CCGCGACGCC GCGGCCCTCG AGCGGGACGA GTCCGCCGCG
GGCGTCGGCA CCGCGACCGT CGAGCTCGTC CGGGCGTTAC TGGTGAGCGC GGCCCGCGGC
CCCGACGAGC CCCCGGGAAC ACCGGCGGAG ATCATGCTGG CCCGCGTGCG GCACCACGTG
CTGGCCCACC TGGCCGATTC CGATCTGAAC GCGGACGGCA TCGCGGCCGC GGTCGGGGTG
TCCGTGCGTC ACCTCTACCG GCTGTGCCGG GAGGCGGAGT TCAGCCTGGA ACAGTGGATC
GTGCGCAACC GGCTGGAACG GGCCCGCGGC GCGCTCGCGG CCCGGGACGC GCGCGGCCGC
AGCATCGCCG CCATCGCGCG CGCCAACGGC TTCGCCGACC CCTCCCACTT CAGCCGCCGC
TTCCGGGCTG CCTACGGCAC CACCCCCCAG GAATGGCGCC GCCACCACAC CCCCCACCAC
ACCTCCCCGC CGCAAACCCC CACCCCGGAG CCCGGGTCGG GGCCCGCCCC CAACCCGGAC
TCAAGGTCAT CTCGCGGTGC CGGAAGGTGC GTGGCGGGTG AGTGA
 
Protein sequence
MATVLDLDHL PAHRRAEVAR DAIVALTVTS RFSVHAQHDI RGRIESWIFG EVGLIRTTVN 
AGQHMIRTAR HVRQDSAPPV LSFAQRRHGR AVQEQFGVRR ELAAGELYCT DLSSAFEYAN
AEGGCGQGLQ VPLASLGLPG EVVRRGAPNL ARSPLYPLVT DHLTALARDA AALERDESAA
GVGTATVELV RALLVSAARG PDEPPGTPAE IMLARVRHHV LAHLADSDLN ADGIAAAVGV
SVRHLYRLCR EAEFSLEQWI VRNRLERARG ALAARDARGR SIAAIARANG FADPSHFSRR
FRAAYGTTPQ EWRRHHTPHH TSPPQTPTPE PGSGPAPNPD SRSSRGAGRC VAGE