Gene EcHS_A3644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3644 
Symbolggt 
ID5594571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3631339 
End bp3633081 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content55% 
IMG OID640922761 
Productgamma-glutamyltranspeptidase 
Protein accessionYP_001460241 
Protein GI157162923 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0405] Gamma-glutamyltransferase 
TIGRFAM ID[TIGR00066] gamma-glutamyltranspeptidase 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAC CGACGTTTTT ACGCCGGGTG GCCATTGCTG CTCTGCTCTC AGGAAGTTGT 
TTTAGCGCCG CCGCCGCGCC TCCTGCGCCG CCCGTCTCGT ATGGTGTGGA GGAAGATGTC
TTCCACCCGG TACGCGCGAA ACAGGGAATG GTAGCGTCTG TGGACGCCAC TGCCACTCAG
GTGGGGGTGG ATATTCTCAA GGAGGGCGGG AATGCCGTTG ATGCCGCCGT GGCGGTGGGC
TACGCGCTGG CGGTAACGCA TCCGCAGGCA GGGAATCTGG GCGGTGGTGG TTTTATGTTA
ATCCGCTCGA AAAATGGCAA TACCACGGCT ATCGATTTCC GCGAAATGGC ACCCGCCAAA
GCGACCCGCG ATATGTTCCT CGATGATCAG GGCAACCCGG ACAGCAAAAA ATCACTCACT
TCGCATCTGG CTTCCGGCAC ACCGGGTACG GTAGCAGGTT TCTCGCTGGC GCTGGATAAA
TACGGCACCA TGCCGCTGAA CAAAGTCGTG CAGCCCGCGT TTAAACTGGC ACGCGATGGT
TTTATCGTTA ACGACGCGCT GGCTGACGAT CTCAAAACCT ACGGTAGCGA AGTGTTGCCG
AATCACGAAA ACAGTAAAGC TATCTTCTGG AAAGAGGGCG AGCCGCTGAA AAAGGGCGAC
ACGCTGGTGC AGGCGAACCT GGCAAAGAGC CTGGAGATGA TTGCTGAAAA CGGCCCGGAC
GAATTCTATA AAGGCACGAT TGCGGAACAG ATCGCCCAGG AGATGCAGAA AAACGGTGGC
TTGATCACTA AAGAAGATTT AGCGGCCTAT AAAGCGGTCG AACGCACTCC GATAAGCGGC
GATTATCGCG GGTATCAGGT TTACTCCATG CCACCGCCAT CCTCCGGCGG GATCCATATC
GTACAAATCC TCAATATTCT GGAAAACTTC GATATGAAGA AATACGGCTT TGGCAGCGCC
GATGCGATGC AAATCATGGC AGAAGCGGAG AAATACGCCT ACGCCGACCG CTCGGAATAT
CTTGGCGACC CGGATTTTGT CAAAGTCCCG TGGCAGGCAC TGACCAATAA AGCCTATGCC
AAATCCATTG CCGATCAAAT TGATATCAAC AAAGCGAAGC CGTCCAGTGA GATTCGCCCT
GGCAAGCTTG CGCCTTATGA GAGTAATCAA ACTACCCATT ACTCTGTAGT GGATAAAGAC
GGTAACGCGG TGGCGGTGAC CTATACGCTG AACACCACCT TCGGTACGGG TATTGTCGCG
GGCGAGAGCG GTATTCTGCT TAATAACCAG ATGGATGATT TCTCCGCCAA ACCGGGCGTA
CCGAACGTTT ACGGGCTGGT GGGCGGTGAT GCCAACGCCG TCGGGCCGAA CAAACGCCCG
CTGTCGTCGA TGTCGCCGAC CATTGTGGTG AAAGACGGTA AAACCTGGCT GGTTACCGGT
AGCCCAGGCG GTAGCCGGAT CATCACTACA GTGCTGCAAA TGGTGGTGAA TAGCATCGAT
TATGGCATGA ACGTCGCCGA AGCGACCAAT GCGCCGCGTT TCCACCATCA GTGGTTGCCG
GACGAGCTGC GTGTCGAAAA AGGGTTTAGC CCGGATACGC TCAAGCTGCT GGAAGCAAAA
GGTCAGAAAG TGGCGCTGAA AGAGGCGATG GGCAGTACAC AAAGCATTAT GGTTGGGCCG
GACGGTGAGT TGTACGGCGC ATCCGACCCG CGCTCGGTGG ATGATTTAAC GGCGGGGTAC
TAA
 
Protein sequence
MIKPTFLRRV AIAALLSGSC FSAAAAPPAP PVSYGVEEDV FHPVRAKQGM VASVDATATQ 
VGVDILKEGG NAVDAAVAVG YALAVTHPQA GNLGGGGFML IRSKNGNTTA IDFREMAPAK
ATRDMFLDDQ GNPDSKKSLT SHLASGTPGT VAGFSLALDK YGTMPLNKVV QPAFKLARDG
FIVNDALADD LKTYGSEVLP NHENSKAIFW KEGEPLKKGD TLVQANLAKS LEMIAENGPD
EFYKGTIAEQ IAQEMQKNGG LITKEDLAAY KAVERTPISG DYRGYQVYSM PPPSSGGIHI
VQILNILENF DMKKYGFGSA DAMQIMAEAE KYAYADRSEY LGDPDFVKVP WQALTNKAYA
KSIADQIDIN KAKPSSEIRP GKLAPYESNQ TTHYSVVDKD GNAVAVTYTL NTTFGTGIVA
GESGILLNNQ MDDFSAKPGV PNVYGLVGGD ANAVGPNKRP LSSMSPTIVV KDGKTWLVTG
SPGGSRIITT VLQMVVNSID YGMNVAEATN APRFHHQWLP DELRVEKGFS PDTLKLLEAK
GQKVALKEAM GSTQSIMVGP DGELYGASDP RSVDDLTAGY