Gene Elen_1913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1913 
Symbol 
ID8416217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2243262 
End bp2244908 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content53% 
IMG OID645024883 
Productacylneuraminate cytidylyltransferase 
Protein accessionYP_003182266 
Protein GI257791660 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1083] CMP-N-acetylneuraminic acid synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0000953718 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATGTGG AGAGGAACGT GCTCGCTGTC ATTCCTGCTC GAGGAGGCTC GAAAGGGATA 
CCACGAAAGA ATATGCGCCT TATGCACGGC AGGCCGTTGA TTGACTATGC GATTTCATGT
GCAGCAGAGA GCAGCTGTAT CACCGATGTT GCCGTTAGCA CTGATTCGAG AGAGATACTG
GATTTCGTAG AACAGATCGA TGGCGTTGTT GCTCTTGAGC GCAGCCCGGA ACTTTCCGGA
GACGATATTA CGCTCGATCC GGTAGTGAAT GACGCAGTGG TTCGTGTGGA AGCGTTGCGG
GGTGTGCGAT ACGACATCGT CATCACGATG CAGCCTACCT CGCCTTTGCT AACAGTGCGC
ACTCTCGATA CGGCGATTGA GCGGTTTTTG AATTCCGATT CGGATACGAT GGTGAGTGCG
ACAAACGCTC CTCATTTGAG CTGGGGCAAA GACGGTCGGA ATCGTTTTGT TCCGAATTAC
GAACGACGTC TCAATCGACA GGAGCTTCCG CCGAACTATG TTGAAACCGG TGCATTTTTG
GTTTCGAAGC GTTCGTGTAT CGGGCCTTCG TCTCGCATAG GGGACAACGT TACGGTTTTT
GAGGTCCCTG CTGACGAAGC GGTCGACATC GACACGATGC AAGATTGGAT CGTGTGCGAA
GCCATGCTCT CTCGAAGGCT CATCGTGTTT CGGGTCGACG GGCATAAGGA GCTTGGCCTT
GGGCATGTGT ACCGTGCATT AACGCTTGGG TACGAACTGA TAGAGCATGA CGTCGTATTC
GTATGCAATG CACGTCATAG AGAAGGAATC GCCAAGTTGA AATCGGCGAA TATGCCGGTT
TTAGAGGTGG AGGACGACGA AGAGATGCTC CGATGGCTGG AGGAGCGCCG TCCCGATGTA
TTCGTATACG ATTGCCTTGA TAGCGATGCC AGTCTTATGG CTGATGTGAA GAGGTATGTT
AAGCGCTTGG TGACGTTCGA GGATTTGGGT GAAGGCGCAA GGTTGGCCGA TGCTGTTGTC
AACGCCATTT ACGAAGGGGC GTCTCCGCAT GGAAACGTGT ATTCGGGTAA GGGGTACGTC
TGCTTGCGGG ACGAGTTCCT TATAACGCAG CCTTCGGAAG ATTCGGATGA GGTGCGCCGG
ATTCTCGTCA CGTTTGGAGG AACGGATCCG CTCGATTTGA CGGCGCGGGT GTACGAACTG
GCAAAAAGGC ATAATGCAGA GGCAGTGGAC GTGACGTTTG ATTTCGTTCT CGGCTCGGGA
TACGACAATC CTGCGGTGCA AAGCGTTCCT GAGTGCGGGA TAGAGGTGAG TCGCAACGTG
TTGCGCATGA GCGACCATAT GCGTAAGGCC GACATGGCTC TTTCCTCCCA AGGGCGCACC
ACGTTCGAGC TTGCCTGCAT GGGCGTGCCT ACAATTGTTC TCGCGGAGAA CGAACGCGAG
CAGCTCCACA CGTTTGCTCA AATGGATAAC GGGTTCATCA ACCTGGGGCT CGGAAGCGAG
GTTTCCGACG AAGACCTCGC TTCAACAATA GCGTGGCTTG CGGGGGCGAG GTCCGTACGG
CGAGAGATGC GCAAGCTGAT GCTTGAGAAC GATTTGAGAT TAGGGATACG AAGAGTAAAG
AGGATCGTTT TGGGAGACGT ACTATGA
 
Protein sequence
MNVERNVLAV IPARGGSKGI PRKNMRLMHG RPLIDYAISC AAESSCITDV AVSTDSREIL 
DFVEQIDGVV ALERSPELSG DDITLDPVVN DAVVRVEALR GVRYDIVITM QPTSPLLTVR
TLDTAIERFL NSDSDTMVSA TNAPHLSWGK DGRNRFVPNY ERRLNRQELP PNYVETGAFL
VSKRSCIGPS SRIGDNVTVF EVPADEAVDI DTMQDWIVCE AMLSRRLIVF RVDGHKELGL
GHVYRALTLG YELIEHDVVF VCNARHREGI AKLKSANMPV LEVEDDEEML RWLEERRPDV
FVYDCLDSDA SLMADVKRYV KRLVTFEDLG EGARLADAVV NAIYEGASPH GNVYSGKGYV
CLRDEFLITQ PSEDSDEVRR ILVTFGGTDP LDLTARVYEL AKRHNAEAVD VTFDFVLGSG
YDNPAVQSVP ECGIEVSRNV LRMSDHMRKA DMALSSQGRT TFELACMGVP TIVLAENERE
QLHTFAQMDN GFINLGLGSE VSDEDLASTI AWLAGARSVR REMRKLMLEN DLRLGIRRVK
RIVLGDVL