Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1949 |
Symbol | |
ID | 8416256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2286235 |
End bp | 2287452 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645024922 |
Product | arginine deiminase |
Protein accession | YP_003182302 |
Protein GI | 257791696 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2235] Arginine deiminase |
TIGRFAM ID | [TIGR01078] arginine deiminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000474348 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.297072 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGGTT TGAACGTCAA GAGCGAGATC AAGCCCTTGA AAAAAGTTCT TCTCCACCGC CCTGGTCGAG AGCTTCTGAA CCTGACGCCG AACACGCTCG AAGAGCTGCT GTTCGACGAC ATCCCGTTTC TGAAGGTCGC TCAGGAGGAG CACGACGCTT TCGCGCAGAT TCTGCGCGAC AACGGCGTGG AGGTCGTGTA CCTCGAGAAG CTCATGGCCG AGGTCCTCGA TCAGAAACCC GAACTGCGCG AGAAGTTCCT CAAGCAGTGG ATCGAAGAGG CCGGTATCCG CACCGACCGC TACCAGAAGA TCATCTTCGA CTATATGCAG GAGAACTACC CCGATAACCT CGACCTGGTC ATGAAGACGA TGGAGGGCAT CAACCTCACC GAGCTTCACA CCGACAAGTC GAACTCCCTG GTCGATCTCG TCAGCGAGTC CTCCAAGATG GTCATCGCCC CCATGCCGAA CCTGTACTTC ACCCGCGATC CGTTCGCGTC CATCGGCAAC GGCGTGTCCA TCAACCGCAT GTACTCCGTC ACGCGCAACC GCGAGACGAT CTACGCCGAG TACATCTTCG GAAACCATCC GGACTTCGCG GATGTTCCCG AGTACTACAG CCGCTACAAC ACGTTCCACA TCGAGGGCGG CGACATCCTC AACATCAACG ACAAGGTGCT GGCCATCGGC ATTTCCCAGC GCACCGAGCC CGACGCCATC GACGCCATCG CGAAGAACAT CTTCGAGGAC GAGACCAGCC CGGTCGAGAC CATCCTGGCG TTCAACATCC CGAACAACCG CGCGATGATG CACCTTGACA CGGTGTTCAC CCAGATCGAC GTCGACAAGT TCACCATCCA TCCCGGCATC ATGGGCCCGC TGACCGTCTT CGAGATCACC GCCGAGGGCG ACGGTATCAA GGTCAAGGAG ATGAGCGGCA AGCTCGAGGA CATCCTCGAG AAGTACGTCG GCAACCCCGT GGAGCTCATT CCCTGCGGCG GCGGCGACCG CATCGCGGCC GAGCGCGAGC AGTGGAACGA CGGCTCGAAC ACGCTGTGCA TCGCGCCGGG CACCATCGTG GTGTACGAGC GCAACGACGT GACGAACGCG CTGCTCAAGG AGAAGGGCCT CAAGGTTCTC GAGATGCCCT CCGCCGAGCT GTCTCGCGGC CGTGGCGGCC CGCGCTGCAT GAGCATGCCG CTTGTGCGCG AGGACTAA
|
Protein sequence | MAGLNVKSEI KPLKKVLLHR PGRELLNLTP NTLEELLFDD IPFLKVAQEE HDAFAQILRD NGVEVVYLEK LMAEVLDQKP ELREKFLKQW IEEAGIRTDR YQKIIFDYMQ ENYPDNLDLV MKTMEGINLT ELHTDKSNSL VDLVSESSKM VIAPMPNLYF TRDPFASIGN GVSINRMYSV TRNRETIYAE YIFGNHPDFA DVPEYYSRYN TFHIEGGDIL NINDKVLAIG ISQRTEPDAI DAIAKNIFED ETSPVETILA FNIPNNRAMM HLDTVFTQID VDKFTIHPGI MGPLTVFEIT AEGDGIKVKE MSGKLEDILE KYVGNPVELI PCGGGDRIAA EREQWNDGSN TLCIAPGTIV VYERNDVTNA LLKEKGLKVL EMPSAELSRG RGGPRCMSMP LVRED
|
| |