Gene Franean1_3848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3848 
Symbol 
ID5672211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4570873 
End bp4571874 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content77% 
IMG OID641242726 
ProductAraC family transcriptional regulator 
Protein accessionYP_001508146 
Protein GI158315638 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0846255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.776114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTCC TGGCCGGAAT GCTGGACGGG CCGCGGGCGC GGGGGGCGTT CACGATCCGC 
TCGGTCATGT CCCCGCCCTG GTCGATGCTC ATCAGGGACC AGGCGCCGCT GACCGTCGTC
GCGGTCGTGC ACGGCGAGGC CTGGGTGCTC CCCCGGGACA CCCCCGCGGT GCAGCTGCGC
GCGGGCGACG TGGCCGTCGC CCGCGGGCCC GACCCGTACG TCGTCGCCGA CGACCCGGCT
ACCCCGCCGC GGATCGTGAT CCATCCCGGG CAGGTCTGCA CCACGCTGTC GGGTGCGAGC
CTCGCCGCGC AGATGGATCT CGGCGTGCGC ACCTGGGGCA ACCAGCGCGA CGGCGCGACG
GCCCCGCCGA CGACGATGCT CACCGGCACG TACCAGCAGC ACAGCGAGGT CAGCCGCCGG
CTGCTCGACG CGCTGCCCGC GCTGGCCGTC ATCCGCGCGG ACGACTGGGA CTGCCCGCTG
GTACCCATGC TGGCCCAGGA GATCGGCCGG GACGACCCGG GCCAGGCCGC CGTCCTCGAC
CGCCTCCTGG ACCTGCTGCT CGTCACCGCG GTGCGGGCCT GGTTCGCCCG CCCCGACACC
GACGCGCCGC CGTGGTGGCG GGCCAACGGC GACCCCGTCG TCGGCCACGC GCTGCGGCTG
CTGCACAACC ACCCCGAGCG CCCCTGGACG ATCGCCGCCC TCGCCGCGGC CACGGGCGTC
TCCCGCGCCT CGTTCGCGCG CCGGTTCGCC AGCCTGGTCG GCGAGCCGCC CATCGCGTTC
CTGACCGGCT GGCGTCTCAC CCTGGCCGCC GACCTGCTCC AGGAGCCGGC GGCCACGGTC
GGCGCGGTGG CCCGCCAGGT CGGCTACGGC AGCCCGTTCG CCCTCAGCAC GGCGTTCCGC
CGCCGGTACG GCGTCAGTCC GCAGCAGTAC CGCGCCCGCG CCCACGGCGA CCGGGCGGAC
CAGCCGCCGT CCGACGGCCG CCCGGCGGAC GCGCCCGGAT GA
 
Protein sequence
MDVLAGMLDG PRARGAFTIR SVMSPPWSML IRDQAPLTVV AVVHGEAWVL PRDTPAVQLR 
AGDVAVARGP DPYVVADDPA TPPRIVIHPG QVCTTLSGAS LAAQMDLGVR TWGNQRDGAT
APPTTMLTGT YQQHSEVSRR LLDALPALAV IRADDWDCPL VPMLAQEIGR DDPGQAAVLD
RLLDLLLVTA VRAWFARPDT DAPPWWRANG DPVVGHALRL LHNHPERPWT IAALAAATGV
SRASFARRFA SLVGEPPIAF LTGWRLTLAA DLLQEPAATV GAVARQVGYG SPFALSTAFR
RRYGVSPQQY RARAHGDRAD QPPSDGRPAD APG