Gene Elen_1949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1949 
Symbol 
ID8416256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2286235 
End bp2287452 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content60% 
IMG OID645024922 
Productarginine deiminase 
Protein accessionYP_003182302 
Protein GI257791696 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2235] Arginine deiminase 
TIGRFAM ID[TIGR01078] arginine deiminase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000474348 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.297072 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGTT TGAACGTCAA GAGCGAGATC AAGCCCTTGA AAAAAGTTCT TCTCCACCGC 
CCTGGTCGAG AGCTTCTGAA CCTGACGCCG AACACGCTCG AAGAGCTGCT GTTCGACGAC
ATCCCGTTTC TGAAGGTCGC TCAGGAGGAG CACGACGCTT TCGCGCAGAT TCTGCGCGAC
AACGGCGTGG AGGTCGTGTA CCTCGAGAAG CTCATGGCCG AGGTCCTCGA TCAGAAACCC
GAACTGCGCG AGAAGTTCCT CAAGCAGTGG ATCGAAGAGG CCGGTATCCG CACCGACCGC
TACCAGAAGA TCATCTTCGA CTATATGCAG GAGAACTACC CCGATAACCT CGACCTGGTC
ATGAAGACGA TGGAGGGCAT CAACCTCACC GAGCTTCACA CCGACAAGTC GAACTCCCTG
GTCGATCTCG TCAGCGAGTC CTCCAAGATG GTCATCGCCC CCATGCCGAA CCTGTACTTC
ACCCGCGATC CGTTCGCGTC CATCGGCAAC GGCGTGTCCA TCAACCGCAT GTACTCCGTC
ACGCGCAACC GCGAGACGAT CTACGCCGAG TACATCTTCG GAAACCATCC GGACTTCGCG
GATGTTCCCG AGTACTACAG CCGCTACAAC ACGTTCCACA TCGAGGGCGG CGACATCCTC
AACATCAACG ACAAGGTGCT GGCCATCGGC ATTTCCCAGC GCACCGAGCC CGACGCCATC
GACGCCATCG CGAAGAACAT CTTCGAGGAC GAGACCAGCC CGGTCGAGAC CATCCTGGCG
TTCAACATCC CGAACAACCG CGCGATGATG CACCTTGACA CGGTGTTCAC CCAGATCGAC
GTCGACAAGT TCACCATCCA TCCCGGCATC ATGGGCCCGC TGACCGTCTT CGAGATCACC
GCCGAGGGCG ACGGTATCAA GGTCAAGGAG ATGAGCGGCA AGCTCGAGGA CATCCTCGAG
AAGTACGTCG GCAACCCCGT GGAGCTCATT CCCTGCGGCG GCGGCGACCG CATCGCGGCC
GAGCGCGAGC AGTGGAACGA CGGCTCGAAC ACGCTGTGCA TCGCGCCGGG CACCATCGTG
GTGTACGAGC GCAACGACGT GACGAACGCG CTGCTCAAGG AGAAGGGCCT CAAGGTTCTC
GAGATGCCCT CCGCCGAGCT GTCTCGCGGC CGTGGCGGCC CGCGCTGCAT GAGCATGCCG
CTTGTGCGCG AGGACTAA
 
Protein sequence
MAGLNVKSEI KPLKKVLLHR PGRELLNLTP NTLEELLFDD IPFLKVAQEE HDAFAQILRD 
NGVEVVYLEK LMAEVLDQKP ELREKFLKQW IEEAGIRTDR YQKIIFDYMQ ENYPDNLDLV
MKTMEGINLT ELHTDKSNSL VDLVSESSKM VIAPMPNLYF TRDPFASIGN GVSINRMYSV
TRNRETIYAE YIFGNHPDFA DVPEYYSRYN TFHIEGGDIL NINDKVLAIG ISQRTEPDAI
DAIAKNIFED ETSPVETILA FNIPNNRAMM HLDTVFTQID VDKFTIHPGI MGPLTVFEIT
AEGDGIKVKE MSGKLEDILE KYVGNPVELI PCGGGDRIAA EREQWNDGSN TLCIAPGTIV
VYERNDVTNA LLKEKGLKVL EMPSAELSRG RGGPRCMSMP LVRED