Gene Elen_0412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0412 
Symbol 
ID8414696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp528646 
End bp529887 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content61% 
IMG OID645023387 
ProductAgmatine deiminase 
Protein accessionYP_003180790 
Protein GI257790184 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.512784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.000333823 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTTGGAAA ACTATCGGCA TCCAGGGGAA TTCGAACCCC AATCAGACGT TTTCATGGAA 
TGGATTCCCG ACGCGTATCA GATGAAGGGC TACGACAACA GCCGGTCGTG CGCCGAGATC
GTCAAAGCGC TGCAGGAGTT CGGGGGCGTG ACGCTCCATA TCAACTGCGG TGCGGAAGGC
GTTCTCGAAC GTGCCAAGTC GAGCTTGGCG GAGAAGGGGG TCGATACGGC CGACATCCGC
TTCGTGCAAT TCGCCGATCC GAACTTCTAT GTGCGGGACA ACGGCCCGAC GGTTATGGTG
GACGATCGAG GCGGCAGAAT CCTGATCAAC CCGAATTGGA GCTACTACGG CACGCTGCCG
CCCGACGACC CGTACTGCGT GCAGTCGCGC ATCGCCGCCG TGCAGATGGG GGTGTCCTTG
GGCATCTTCG ACGTGGTGAA TTCCGATGTG GTGTCCGAAG GAGGGGATCG GGAGTTCAAC
GGTCAGGGCG TCATGATGTG CATCGAGGAC ACGGAAGTGC GCAAACGTAA TCCGGGTCTT
ACGAAAGAGC AGGTAGAAGC CGAATTCAAG AGACTCTACA ACGTGGAGAA GATCATCTGG
ATCCCACAGC CTTTGCTAGA AGACGACGAT TTCAGGATGG GGCCGTTGGA ATACCGCGAC
GGCGTGCCGT ACCTCGGCTC CAGCTTCGCG GCCCATATCG ACGAGCTGTG CCGCTTCGTG
GATGCGAACA CCGTCGTGCT TGCCGAGGTG ACCGACGATG AGGCGGCGGA AAGCGCGATC
GGCGCAGAGA ACAAACGACG CATCGAAGCC GCCTACGATG TGCTCTCGAA GGCGACGGAC
GTCCATGGCA ACCCGTTCGC CATCAAGCGC ATGCCCGTGC CTATCTCCAT CGATTACGTC
TTGACCGAGG ACGACGAGAA CTACGGGCTG TACGAGGGGC CCGTGATGGA GATGGGCGGC
GCCTTCGCCG ACGGCACGCC GTGGCCCGGC GGCCCCATCC ATCTCATAGC CTCGACGGGG
TACTGCAATT TCCTCATCTG CAACGGCGTG GTCATCGGCC AGCGCTACTG GCATGAGGGG
ATGGATCCGG CAATCAAGGG GAAGGACGAA GCCGCCCAAG CGGTTCTCGA GGAGTGCTTC
CCGGATCGCA CGGTGGTGAT GGTGGACAGC TTGGCGCTGA ACATGACCGG CGGCGGCGTG
CATTGCTGGA CGAAGAACGT TGCGGCGTCC GAGCCGCGAT GA
 
Protein sequence
MLENYRHPGE FEPQSDVFME WIPDAYQMKG YDNSRSCAEI VKALQEFGGV TLHINCGAEG 
VLERAKSSLA EKGVDTADIR FVQFADPNFY VRDNGPTVMV DDRGGRILIN PNWSYYGTLP
PDDPYCVQSR IAAVQMGVSL GIFDVVNSDV VSEGGDREFN GQGVMMCIED TEVRKRNPGL
TKEQVEAEFK RLYNVEKIIW IPQPLLEDDD FRMGPLEYRD GVPYLGSSFA AHIDELCRFV
DANTVVLAEV TDDEAAESAI GAENKRRIEA AYDVLSKATD VHGNPFAIKR MPVPISIDYV
LTEDDENYGL YEGPVMEMGG AFADGTPWPG GPIHLIASTG YCNFLICNGV VIGQRYWHEG
MDPAIKGKDE AAQAVLEECF PDRTVVMVDS LALNMTGGGV HCWTKNVAAS EPR