Gene EcHS_A1828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1828 
SymbolastB 
ID5591865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1844998 
End bp1846341 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content55% 
IMG OID640920972 
Productsuccinylarginine dihydrolase 
Protein accessionYP_001458524 
Protein GI157161206 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3724] Succinylarginine dihydrolase 
TIGRFAM ID[TIGR03241] succinylarginine dihydrolase 


Plasmid Coverage information

Num covering plasmid clones63 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCT GGGAAGTTAA TTTCGACGGG CTGGTAGGGC TGACGCATCA TTACGCTGGC 
CTGTCGTTTG GTAATGAAGC CTCTACCCGT CACCGTTTTC AGGTGTCTAA CCCGCGACAG
GCGGCAAAAC AGGGCTTACT GAAAATGAAA ACCCTTGCCG ATGCGGGATT CCCCCAGGCG
GTGATCCCGC CGCACGAGCG TCCGTTTATT CCGGTGCTGC GTCAGTTGGG ATTCAGTGGT
AGCGATGAGC AGGTACTGGA AAAGGTCGCA CGCCAGGCAC CGCACTGGCT TTCCAGCGTC
AGTTCCGCTT CGCCAATGTG GGTAGCCAAT GCGGCAACGA TCGCGCCATC TGCCGATACG
CTGGATGGCA AAGTGCATTT CACGGTTGCC AACCTGAACA ATAAATTTCA CCGTTCGCTG
GAAGCGCTCG TCACTGAATC GCTGTTAAAA GCGATTTTTA ACGACGAAGA GAAATTTAGC
GTCCATTCGG CGTTGCCACA GGTAGCGTTG CTCGGTGATG AGGGGGCGGC AAACCACAAT
CGTCTCGGCG GTCATTACGG TGAACCGGGT ATGCAACTTT TTGTCTACGG GCGAGAAGAA
GGCAATGATA CCCGGCCTTC CCGTTATCCG GCGCGACAGA CTCGCGAAGC CAGCGAGGCG
GTGGCAAGGC TGAATCAGGT GAATCCCCAA CAGGTGATTT TCGCCCAGCA AAACCCGGAC
GTTATCGACC AGGGCGTTTT TCATAATGAC GTGATTGCCG TGAGTAACCG CCAGGTGCTG
TTTTGCCACC AACAGGCGTT CGCTCGCCAG TCACAGTTAC TGGCAAACCT GCGTGCGCGG
GTCAATGGTT TTATGGCGAT AGAAGTTCCG GCAACTCAGG TTTCCGTGTC TGATACGGTG
TCTACCTATC TGTTTAACAG CCAACTGCTG AGCCGCGATG ATGGTTCCAT GATGTTGGTG
CTGCCTCAGG AGTGTCGGGA ACACGCCGGA GTATGGGGTT ATCTCAATGA ACTCCTTGCC
GCTGACAACC CGATTAGCGA ACTAAAAGTC TTTGATTTAC GTGAAAGCAT GGCGAATGGC
GGCGGCCCGG CGTGCCTGCG GTTGCGGGTG GTATTGACAG AAGAAGAACG CCGGGCGGTG
AATCCGGCGG TGATGATGAA CGATACGCTG TTTAATGCGC TCAATGACTG GGTGGATCGT
TACTACCGCG ATCGCCTTAC TGCTGCCGAT CTGGCCGACC CGCAATTGCT GCGCGAAGGG
CGGGAAGCAC TGGATGTATT GAGCCAATTA CTGAATCTCG GTTCGGTTTA TCCGTTCCAG
CGCGAGGGAG GGGGCAATGG ATAA
 
Protein sequence
MNAWEVNFDG LVGLTHHYAG LSFGNEASTR HRFQVSNPRQ AAKQGLLKMK TLADAGFPQA 
VIPPHERPFI PVLRQLGFSG SDEQVLEKVA RQAPHWLSSV SSASPMWVAN AATIAPSADT
LDGKVHFTVA NLNNKFHRSL EALVTESLLK AIFNDEEKFS VHSALPQVAL LGDEGAANHN
RLGGHYGEPG MQLFVYGREE GNDTRPSRYP ARQTREASEA VARLNQVNPQ QVIFAQQNPD
VIDQGVFHND VIAVSNRQVL FCHQQAFARQ SQLLANLRAR VNGFMAIEVP ATQVSVSDTV
STYLFNSQLL SRDDGSMMLV LPQECREHAG VWGYLNELLA ADNPISELKV FDLRESMANG
GGPACLRLRV VLTEEERRAV NPAVMMNDTL FNALNDWVDR YYRDRLTAAD LADPQLLREG
REALDVLSQL LNLGSVYPFQ REGGGNG