Gene Elen_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2003 
Symbol 
ID8416314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2347849 
End bp2350452 
Gene Length2604 bp 
Protein Length867 aa 
Translation table11 
GC content69% 
IMG OID645024980 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003182356 
Protein GI257791750 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.228987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000440234 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGGAAT TCCTCAAGAA GGCGTCGTGC CGAGGCCGTC GTCCGCTGCA TCTCGTCCCG 
CGGCGCCAGA TACCTCGCCC CAACCTCATC GCTAAGCTTT TGCGCGAGCG GCACGTCGCG
CGGTTCATCG TCGCGCCCGA CGGCTTCGGC AAGACGGGTC TTGCGATGGA ATACGCCGAT
ACGGTGTTCT CGTTCGAGCA TGTGGTCTGG CTCGACGGCC GCAGCCCGTG CTTCTTGCGC
GATCTCGATC GCGGCATCAT AGCGCGCACG CTTTTGGAGG GCGACCCGCA GCCGTTCCTC
GTCGTCATAG ACGATGTGCC GCTGCTCGAT CCTGAGCGCG CCGAGCTGTT ATCCGACGAG
CTCGATCTGC TGCTCGACCG CGATTGCGAG GTGATGGTGG CCTGCATGCC ATCGCGCGAC
GCCTTCGCGC GTCATCGCGA CCGCATCAAG CTCACGGCCG TCGACCTGCT GCTCACCGAT
GAGGAAGTGG ACCTGCTCCG CACACCGGGG GAGCGCTCGA GCGATCCCGC ATCGACGATT
GCGCCCGCGT GCCGCGTCGC CACGTTCGTA TGGGGGTCGG ACGAGGAGCG CGCGGGGTTC
TTGTCCGCCG TGCTTGGCGA GGAGCTTCCG GCCGACCTGC TGCTGCCGCT GTTCGTGATG
GCGTCGCTGG GGAAGGGGAC GTTCGAGGAC GTGGCGGCGT TCGGCCCGTT CGGCCCCGAT
CAGGACGCGC TGCTCGCCGA GCAGTACCCC TATCTCGGCA TCGACTCCCG CAGCGGCCGG
TTCGAGGCCG CGCCGTTCGA GGTCGAAGCC CTCGCCGGCG CGTTTGCGCG CAAGCTGGGC
GCGTTGGCCG ATCGGTCGTT GTTCTCCGAT GCGAGCACGC TCGTCGTGCG CTTGGGCGAT
GCGCTCGCGA TGCGCGGCGA GCACGAACGG GCCTGCGATT TCGTACGTTT GCTGGCGTCG
CGCGCCCCGC GAGCGTCCTG GCTCGCGAAA CACGGGGCGG CGTTGCTCGA AGCCGCTTGC
CTCGTGCCGG CCTGCGAGGT GCACCGCTCC TTATCGGGCG AGACGGCGGG GAAGGGCGCT
CTGCTCAGCG CCCACGAGGC GGTTCGCCGC GCGCTTTTGG GGGATCGACA AGCGGCCTGC
GTTGCCGCGC GCAAGGCGGT GCGCGACAGG AACGCGCCGT CCTCCGTGCG GATGACGGGC
GCGTTGGTGC TCGCGCAATG CGCGGAAGCG GACGAGCGGC GGCGCGCCGA CAAGCTCGTC
GTGTCGCTGC TCGCGGCCGC GGGCATCGCG GATGCGGCCG AAGCGCCCCG TGAGGCCGTC
TGCGCGAAGG CGCCGGAAGA TCGGGGATGG CTCGCTGCGG GATGCGTGCA TGCGCAGCTG
CGGAAAGCGG GCTGCCCCGA GGCTGCTCGG GTATGGCTCG ATTGGCACGA CGACGGCGCG
CGCGGGAGCT TCCTCAAGCA GTCGGCCGCC GAAGTGCTGC GTCAAGCTGC GGCGTCGGGA
GCCGGCGAGG CCCGATCGAT CGAGCTCGAC CGTTTGGGCG CCTTCGTGCG CAAGGAGGTG
TCCCAGGCGA GCCGCGGCAC GCTCGGGCTC GGCGACGCTT TGGCCGGCAT CGCCTACGAG
CGGGCCTGCG AGCGCGGGGC GATCGCCGTT CCCGCGCTCG ACGCGCAGGC GGCCCTCGCC
GTCAGGCGCA TAGAGATGCG CCTGTTCTCC CAGCGCAACG CGTGCGAGCG GCTGGAGCTG
GAGCGATTGG AACGCGAGCG CGTATTCGTC GCCACCCATC CCGACGCATA CCGTGAAGAG
GGTCGCGCGA CTCGCCTGCC GAAGCTGCCG TCCTCCGTGC CCGCGCTCAC CGTCAACCTG
TTCGGGGGCC TCGACGTGCT GATGGGGGAT GAGCGGGTGG ATCCGTCGCT GTTCAGCCGC
CAGAAGGTCA AGACGCTGCT CGCCCTGCTC GTGCTGCACA ACGGCCGCGA GTTCTCGCGC
GACAAGCTGG TGGGTCTTCT GTGGCCCGAC AGCGAGATCA TGCACGGGCG CAAGAACTTC
TACGGCATTT GGGCGATGCT GCGCCGCGCG CTGACGCTGC CGTCGGGCGA GTGCCCCTAT
CTCATTCGCC AGCAGCAGGG GTTGAGACTT GACGCGAGCC TGCTGACGAG CGATGTGGCG
CAGCTCGAGG ACGTGTGCCG CACGCTGCTG TTCGAGCGGC CCGGATACGG CGGCTGGGCG
CAGGTGTACT CGCAGGTCAA CGATCGCTTC TCGGACGATC TTCTGCCCAG CGAGAACGGC
AACGACGCCC TCGCGTCGCT GCGCGTGGAC TACCGCAACC GCCTCGTGGA CGCGCTGGTG
GCCGCCTCGA CGCGCCTCGT CGCTGCAGGC GAGGCCCAGG AGGGACTCTG GTTCGCCCGT
GCGGCGCTCC AGCGCGACCG CTCGCGCGAA GACGCCTACA TCTGCCTCAT GCAGGCCCAG
CTGGCCGCGG GGCAGCGTAC GGCCGCTCTC GAGACCTACT TTGCCTGCCG CCGCTTCCTC
ACCGACGAGC TGGGTATCGA CCCCTCCCTC GAAACGATGC GCCTCTATCG CAGCATCATC
GAAACCGAAA CCGATTTCGA GTGA
 
Protein sequence
MPEFLKKASC RGRRPLHLVP RRQIPRPNLI AKLLRERHVA RFIVAPDGFG KTGLAMEYAD 
TVFSFEHVVW LDGRSPCFLR DLDRGIIART LLEGDPQPFL VVIDDVPLLD PERAELLSDE
LDLLLDRDCE VMVACMPSRD AFARHRDRIK LTAVDLLLTD EEVDLLRTPG ERSSDPASTI
APACRVATFV WGSDEERAGF LSAVLGEELP ADLLLPLFVM ASLGKGTFED VAAFGPFGPD
QDALLAEQYP YLGIDSRSGR FEAAPFEVEA LAGAFARKLG ALADRSLFSD ASTLVVRLGD
ALAMRGEHER ACDFVRLLAS RAPRASWLAK HGAALLEAAC LVPACEVHRS LSGETAGKGA
LLSAHEAVRR ALLGDRQAAC VAARKAVRDR NAPSSVRMTG ALVLAQCAEA DERRRADKLV
VSLLAAAGIA DAAEAPREAV CAKAPEDRGW LAAGCVHAQL RKAGCPEAAR VWLDWHDDGA
RGSFLKQSAA EVLRQAAASG AGEARSIELD RLGAFVRKEV SQASRGTLGL GDALAGIAYE
RACERGAIAV PALDAQAALA VRRIEMRLFS QRNACERLEL ERLERERVFV ATHPDAYREE
GRATRLPKLP SSVPALTVNL FGGLDVLMGD ERVDPSLFSR QKVKTLLALL VLHNGREFSR
DKLVGLLWPD SEIMHGRKNF YGIWAMLRRA LTLPSGECPY LIRQQQGLRL DASLLTSDVA
QLEDVCRTLL FERPGYGGWA QVYSQVNDRF SDDLLPSENG NDALASLRVD YRNRLVDALV
AASTRLVAAG EAQEGLWFAR AALQRDRSRE DAYICLMQAQ LAAGQRTAAL ETYFACRRFL
TDELGIDPSL ETMRLYRSII ETETDFE