Gene Franean1_3682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3682 
Symbol 
ID5672048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4359147 
End bp4360769 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content73% 
IMG OID641242565 
ProductTetR family transcriptional regulator 
Protein accessionYP_001507985 
Protein GI158315477 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0191871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCCA GGGCGGGTGG CGAGCGTTCG CGGGCCGGGC GCCCGACGGC GACGGAGGCA 
GCCAGGCTGA CCGAGCGGCT CCGGCGGGCC GCCGTCGACA CGTTCCTGGA GTCGGGTTAC
GACGGCACCA CGATGGAGGC CGTCGCGCAG GCCGCGGGGA TCACCAAGAG CACCTTGTAT
GCCCGCTATC CCGACAAGCG AACTCTGTTC ATCGCGGTGA GCTCGTGGGC GCTCACCCGC
CAGGAGCGCG ACGAGCGCGT CCTCGAACCG CTGCCCGACG ACCTCGCCGA GAGCCTGACC
GTGATCGCGC GGGCGATCCT GGCCCGCTCG GTCGACCCCG ACATCGTCCG CCTGAGCCGG
ATGGCGATCG CCGAGTCGGC GCGCTTCCCC GAGTTCGCGG CGAGCTCGCA GGCCGTCACC
TGGTCACCGC GGGTGCAACT GATCATTGAT CTGCTACGCA GGCACGAGAG CGCGGGCACC
GTGGTCGTCG GGGACGTCGA CCTCGCCGCC GAGCAGTTCT TCGCCATGGT CGGCGCGATG
CCGGCGTGGC TGGCCGCCTA CGGCATCTAC CGGACCCCCG AGGTCGAGGA GGAGCACCTC
CACCACGCGG TGAGCCTCTT CCTCAACGGC GTCCTGGCCA GGCCGGAGAC CATGCCGGCC
CCGCACCCGA CCGAGGCCGC CAGGCCCGCC GGGGAGGGAC CGGCCGTCCC TCGTGCGAGC
TGGCCGATGG AGTACCGGCG GCTCGGCCGC AGCGGGCTGA ACGTCTCGCG GCTCGCGCTG
GGAGTCGCGG CTCTCGGCGC CGGGCCCGCG GACCATGACG AGCGGGCCGC GATCGGCGTC
ATCCACCGCT TCCTCGACGC CGGCGGCAAC CTCCTCGACA CCACCGCCGC GGGCGACATC
GGCCGGGGTG GCACGGACCC GGACGGCGCC GCGGCGGAGC TCTGCGGCCG TGCCGTTCGG
GACAGGCGCT CGAGCGTCGT CCTGGCCTCG AGCGTCGGGC GGCCGAGCGG GCAGGGCCCC
CACGACGGCG GGAACAGCCG CCGTCACATC CGGGCCGCCT GCGAGGCGAC CCTCCGCCGG
CTCCGGACGG ACTATCTCGA CCTCCTCCAA CTCGACGCCG ACGACCCGAC GACCCCGCTG
GAGGAGACGA TCGACGCGCT GGACGACCTG GTGCGCGCCG GGAAGGTCCT CTACGTCGGC
GTCGCCAACC TGCACGTCTA CCGGGTGACG AAGGCGCTGT CGGTCAGCGA CCGGCTCGGC
CGGGCCCGTT TCATCTCGTT CCGCGGCCCG TACGGCCTGC TCTCGCGGGA GCTCGAACAC
GAGCACCTCC CGCTGCTGGC GGAGGAAGGC CTCGGCCTGA TCAGCACGAG CTCACTCCGC
TCCCCCGGGC ACGGGCACGG GCACGGGCAC GCCACTGTCG CCGCCACGGA GGCCGCGGCG
GCGGAGCTCG GGTGCACGAC CACGCAGCTG TCGCTGGCCT GGCAACTGAC GAGATCCGTC
ACCTCGATCA CGCTCGACGT CGCCTCCGCG GCCCAGCTGG ACGAGCACCT CGCGGCCCTG
GGCATCGAGA TCCCCACCGA GATCGCGGCA GCACTGGAAC AGGTCTCCCG TCCCCAGGGG
TAA
 
Protein sequence
MESRAGGERS RAGRPTATEA ARLTERLRRA AVDTFLESGY DGTTMEAVAQ AAGITKSTLY 
ARYPDKRTLF IAVSSWALTR QERDERVLEP LPDDLAESLT VIARAILARS VDPDIVRLSR
MAIAESARFP EFAASSQAVT WSPRVQLIID LLRRHESAGT VVVGDVDLAA EQFFAMVGAM
PAWLAAYGIY RTPEVEEEHL HHAVSLFLNG VLARPETMPA PHPTEAARPA GEGPAVPRAS
WPMEYRRLGR SGLNVSRLAL GVAALGAGPA DHDERAAIGV IHRFLDAGGN LLDTTAAGDI
GRGGTDPDGA AAELCGRAVR DRRSSVVLAS SVGRPSGQGP HDGGNSRRHI RAACEATLRR
LRTDYLDLLQ LDADDPTTPL EETIDALDDL VRAGKVLYVG VANLHVYRVT KALSVSDRLG
RARFISFRGP YGLLSRELEH EHLPLLAEEG LGLISTSSLR SPGHGHGHGH ATVAATEAAA
AELGCTTTQL SLAWQLTRSV TSITLDVASA AQLDEHLAAL GIEIPTEIAA ALEQVSRPQG