Gene Franean1_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2003 
Symbol 
ID5670404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2408652 
End bp2409680 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content74% 
IMG OID641240924 
ProductAraC family transcriptional regulator 
Protein accessionYP_001506346 
Protein GI158313838 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0833293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGACCGG CGGGATCACC CGTTGGCGGG AAACAGTCGC CTGGTTATCG TCCTGCCATG 
CACACCGTGG CGGTTCTCGC CCTCGAGGGC GTGGTCCCCT TCGATCTCGC GGTGCCCGTC
GACACCTTCG GCCGTGCCCG GCTCCCCGAC GGCCGGGCGG CCTATCGGGT GAAGATCTGC
GCTGGTCGGA GCATCAGCGG CACCGTCGAG GCCGACGGCG GGGCGTTCTC CCTGCGGGCC
CCCTGGGGGC TGGACGCGCT CGCCGAGGCC GACACGATCG TCGTCCCCGG CGTCGGCGAG
CAGGCCGGAC CCCTGCCCCC GGAGTCCGTC ACCGCGTTGC GGGCCGCCGC CGCGCGCGGC
ACCCGCATCG CCTCCATCTG CGTCGGCGCG TTCCTGCTCG CGCAGACCGG CCTGCTGGAC
GGCCTGCGCG CGACCACGCA CTGGATCGCC GCCGAGGAGC TCGCCCGCCG TCATCCCGCC
GTGCGGGTGG ACCCGAACGT GCTGTTCGTC GACAACGGGC AGATCCTGAC CTCGGCCGGC
GCCGCGGCCG GCCTCGACCT GTGCCTGCAC ATGATCCGCT CCGACCATGG GGTCGCCGTG
GCCGCGGACG TCGCCCGGCT GTCGGTCGTC CCGCTGGCCA GGGAAGGTGG CCAGGCCCAG
TTCATCGTCC GCGACCGCCC GCCGCCGGAC GGCTCCGTCC TCGAACCACT GTTGCGGTGG
ATGGAGGCGA ACTGCCACCG CCCGTTGACG GTGGACGACC TCGCCGCCCA GGCGATGACG
AGCCCACGCA CGCTCAACCG CCGCTTCCGC GAACAGACGG GCACGACCCC GTCCCAGTGG
CTGCACCGGG TGCGCCTGCG GCAGGCGCAG TACCTGCTTG AGACCACCGG GCACTCTGTG
GAGCGGATCG CCGCGCAGGT CGGCTTCGGA TCGCCCACCG CCTTCCGGGA CGGGTTTCGC
CGGCTGGTCG GCACCAGCCC CCAGGCCTAC CGCCGGGCCT TCCGTGACGC TGTCACGCCT
CCCGGGTAG
 
Protein sequence
MGPAGSPVGG KQSPGYRPAM HTVAVLALEG VVPFDLAVPV DTFGRARLPD GRAAYRVKIC 
AGRSISGTVE ADGGAFSLRA PWGLDALAEA DTIVVPGVGE QAGPLPPESV TALRAAAARG
TRIASICVGA FLLAQTGLLD GLRATTHWIA AEELARRHPA VRVDPNVLFV DNGQILTSAG
AAAGLDLCLH MIRSDHGVAV AADVARLSVV PLAREGGQAQ FIVRDRPPPD GSVLEPLLRW
MEANCHRPLT VDDLAAQAMT SPRTLNRRFR EQTGTTPSQW LHRVRLRQAQ YLLETTGHSV
ERIAAQVGFG SPTAFRDGFR RLVGTSPQAY RRAFRDAVTP PG