Gene Elen_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2066 
Symbol 
ID8416383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2432688 
End bp2433767 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content71% 
IMG OID645025048 
Productputative transcriptional regulator, AsnC family 
Protein accessionYP_003182418 
Protein GI257791812 
COG category[K] Transcription 
COG ID[COG1522] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.348008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0096425 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGCTG AGGCGCGCGG CACGCTGGGC GTCCACCTCG ATTCGCCCGT CGCCCGCCGC 
GTGCTCACGC GCGTGCAGCA GGAGCTTCCC GTGTGCGAGC GCCCCTACGC GGCGCTGGGG
GATGCGTGCG GCACGTCGGA GGAGCAGGCG TTCGCCGCGG TGGAGGCTGC GCGCACGGCC
GACATCGTCC GACGCATCGG CGCCAGCTTC GAATCGTCGC GCATCGGCTA CGCATCCACG
CTCGTCGCGC TTGCCGTGGA ACCCGGCGAC CTCGACCGGG TGGCCGCGCT CGTGGGCGCG
CATCCGGGCA TCACGCACAA CTACGAGCGC GACGACCGCT ACAACCTGTG GTTCACGCTC
ATCGCGCGGG GCGCGGAGGC GCGCGACGCG GAGCTTGCGC GCATCGTGGC GCAGACGGGA
TGCGACGACG TGCTGGCGCT GCCGGCCATC CGCCTGTTCA AGATAAAGGT CGCCTTCGAC
GTGCGCGAAG GAGCTGGAAC CGACGAGGCG TCCGCCGCTG CTCCTTCGTC GCTTCGCGCT
CCCCTCGAGC CGGCGCGCGT CGTCGCAGAG CCGCTCGACG ATGCCGATCG GGCGCTCGTG
CGCGCGCTGC AAGGCGATCT GGGCGGCACG CTGCGCCCCT TCGCGCGCGC GGCCGAGGTC
GCTTCCTCGT ATGCGGGCAC TGCGTTGAAC GAGCGATGGG CCTCCGATCG CACGCGCGCG
TTGCTGGAAG CCGGGGCTGT CCGGCGCTTC GGCGCGATGG TCCGGCACCG CCGCATGGGG
TTCTCGAGCA ACGCCATGGG GGTGTGGAAC GTGCCGGACG AGCAAGTGCT GGCCGCGGGC
ACCGTTTTGG CGGCCCCGGC CGAGGTGAGC CATTGCTATG AGCGGCCGCG TTCGCAGACG
TGGCCCTACA ACTTGTACAC GATGATCCAC GGCCGCGACC GCGACGCCTG CGAGCGGACG
GCCGCCCGGC TCCACGACGA CTTGTCGAGG GCCGGCATCG ACGTGCTGCC CGCGCGCCTG
CTGCTGTCCA CTCGGGAGTT CAAGAAGACG TCCATGCGCT ACTTCGAGGA GGAACGATGA
 
Protein sequence
MAAEARGTLG VHLDSPVARR VLTRVQQELP VCERPYAALG DACGTSEEQA FAAVEAARTA 
DIVRRIGASF ESSRIGYAST LVALAVEPGD LDRVAALVGA HPGITHNYER DDRYNLWFTL
IARGAEARDA ELARIVAQTG CDDVLALPAI RLFKIKVAFD VREGAGTDEA SAAAPSSLRA
PLEPARVVAE PLDDADRALV RALQGDLGGT LRPFARAAEV ASSYAGTALN ERWASDRTRA
LLEAGAVRRF GAMVRHRRMG FSSNAMGVWN VPDEQVLAAG TVLAAPAEVS HCYERPRSQT
WPYNLYTMIH GRDRDACERT AARLHDDLSR AGIDVLPARL LLSTREFKKT SMRYFEEER