Gene EcSMS35_1446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1446 
SymbolastB 
ID6143852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1430840 
End bp1432183 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content56% 
IMG OID641616324 
Productsuccinylarginine dihydrolase 
Protein accessionYP_001743504 
Protein GI170681484 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3724] Succinylarginine dihydrolase 
TIGRFAM ID[TIGR03241] succinylarginine dihydrolase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.555371 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCT GGGAAGTCAA TTTCGACGGG CTGGTAGGGC TGACGCATCA TTACGCGGGC 
CTGTCGTTTG GCAATGAAGC CTCTACCCGT CACCGTTTTC AGATCTCTAA CCCGCGGCTG
GCGGCAAAAC AGGGCTTACT GAAAATGAAA AACCTTGCCG ATGCGGGATT CCCCCAGGCG
GTGATCCCGC CGCACGAGCG CCCGTTTATT CCGGTGCTGC GTCAGTTGGG TTTCAGTGGT
AGCGATGAGC AGGTACTGGA AAAGGTTGCA CGCCAGGCAC CGCACTGGCT TTCCAGCGTC
AGCTCCGCTT CGCCAATGTG GGTAGCCAAT GCGGCAACGA TCGCGCCATC TGCCGATACG
CTGGATGGCA AAGTGCATCT CACCATTGCT AACCTGAACA ATAAATTTCA CCGTTCGCTG
GAAGCCCCCG TCACCGAATC GCTGTTAAAA GCGATTTTTG ACGACGAAGA GAAATTTAGC
GTCCATTCGG CGTTGCCGCA GGTAGCCTTG CTCGGTGATG AGGGGGCGGC AAACCACAAT
CGTCTCGGCG GTCATTACGG TGAACCGGGT ATGCAACTTT TTGTCTACGG GCGAGAAGAG
GGCAATGATA CCCGGCCTTC CCGTTATCCG GCGCGACAGA CTCGCGAAGC CAGCGAGGCG
GTGGCAAGGC TGAATCAGGT GAATCCCCAA CAGGTGATTT TCGCCCAGCA AAACCCGGAC
GTTATCGACC AGGGCGTTTT TCATAATGAC GTGATTGCCG TGAGTAACCG CCAGGTGCTG
TTTTGCCATC AACAGGCGTT CGCTCGCCAG GTGCAGTTAC TGGCAAACCT GCGTGCGCGG
GTGAACGGTT TTATGGCGAT AGAAGTTCCG GCGACTCAGG TTTCCGTGTC AGATGCGGTG
TCTACATATC TGTTTAACAG CCAACTGCTG AGCCGCGATG ATGGTTCCAT GATGTTGGTG
CTGCCTCAGG AGTGTCGGGA ACACGCCGGA GTATGGGGTT ATCTCAATGA ACTCCTTGTC
GCTGACAACC CGATTAGCGA ACTAAAAGTC TTTGATTTAC GTGAAAGCAT GGCGAATGGC
GGAGGTCCGG CGTGCCTGCG GTTGCGCGTG GTATTGACAG AAGAAGAACG CCGGGCAGTG
AATCCGGCGG TGATGATGAA CGATACGCTG TTTAATGCGC TCAATGACTG GGTGGATCGT
TACTACCGCG ATCGCCTTAC TGCTGCCGAT CTGGCCGACC CGCAATTGCT GCGCGAAGGG
CGGGAAGCAC TGGATGTATT GAGCCAATTA CTGAATCTCG GTTCGGTTTA TCCGTTCCAG
CGCGAGGGAG GGGGCAATGG ATAA
 
Protein sequence
MNAWEVNFDG LVGLTHHYAG LSFGNEASTR HRFQISNPRL AAKQGLLKMK NLADAGFPQA 
VIPPHERPFI PVLRQLGFSG SDEQVLEKVA RQAPHWLSSV SSASPMWVAN AATIAPSADT
LDGKVHLTIA NLNNKFHRSL EAPVTESLLK AIFDDEEKFS VHSALPQVAL LGDEGAANHN
RLGGHYGEPG MQLFVYGREE GNDTRPSRYP ARQTREASEA VARLNQVNPQ QVIFAQQNPD
VIDQGVFHND VIAVSNRQVL FCHQQAFARQ VQLLANLRAR VNGFMAIEVP ATQVSVSDAV
STYLFNSQLL SRDDGSMMLV LPQECREHAG VWGYLNELLV ADNPISELKV FDLRESMANG
GGPACLRLRV VLTEEERRAV NPAVMMNDTL FNALNDWVDR YYRDRLTAAD LADPQLLREG
REALDVLSQL LNLGSVYPFQ REGGGNG