Gene EcolC_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2008 
Symbol 
ID6068075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2214546 
End bp2215718 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content51% 
IMG OID641601422 
Productaminotransferase class I and II 
Protein accessionYP_001724981 
Protein GI170020027 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1168] Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00273969 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.938908 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGATT TTTCAAAGGT CGTGGATCGT CATGGCACAT GGTGTACACA GTGGGATTAT 
GTCGCTGACC GTTTCGGCAC TGCTGACCTG TTACCGTTCA CGATTTCAGA CATGGATTTT
GCCACTGCCC CCTGCATTAT CGAGGCGCTG AATCAGCGCC TGATGCACGG CGTATTTGGC
TACAGCCGCT GGAAAAACGA TGAGTTTCTC GCGGCTATTG CCCACTGGTT TTCCACCCAG
CATTACACCG CCATCGATTC TCAGACGGTG GTGTATGGCC CTTCTGTCAT CTATATGGTT
TCAGAACTGA TTCGTCAGTG GTCTGAAACA GGTGAAGGCG TGGTGATCCA CACACCCGCC
TATGACGCAT TTTACAAGGC CATTGAAGGT AACCAGCGCA CAGTAATGCC CGTTGCTTTA
GAGAAGCAGG CTGATGGTTG GTTTTGCGAT ATGGGCAAGT TGGAAGCCGT GTTGGCGAAA
CCAGAATGTA AAATTATGCT CCTGTGTAGC CCACAGAATC CTACCGGGAA AGTGTGGACG
TGCGATGAGC TGGAGATCAT GGCTGACCTG TGCGAGCGTC ATGGTGTGCG GGTTATTTCC
GATGAAATCC ATATGGATAT GGTTTGGGGC GAGCAGCCGC ATATTCCCTG GAGTAATGTG
GCTCGCGGAG ACTGGGCGTT GCTAACGTCG GGCTCGAAAA GTTTCAATAT TCCCGCCCTG
ACCGGTGCTT ACGGGATTAT AGAAAATAGC AGTAGCCGCG ATGCCTATTT ATCGGCACTG
AAAGGCCGTG ATGGGCTTTC TTCCCCTTCG GTACTGGCGT TAACTGCCCA TATCGCCGCC
TATCAGCAAG GCGCGCCGTG GCTGGATGCC TTACGCATCT ATCTGAAAGA TAACCTGACG
TATATCGCAG ATAAAATGAA CGCCGCGTTT CCTGAACTCA ACTGGCAGAT CCCACAATCC
ACTTATCTGG CATGGCTTGA TTTACGTCCG TTGAATATTG ACGACAACGC GTTGCAAAAA
GCACTTATCG AACAAGAAAA AGTCGCGATC ATGCCGGGGT ATACCTACGG TGAAGAAGGT
CGTGGTTTTG TCCGTCTCAA TGCCGGCTGC CCACGTTCGA AACTGGAAAA AGGTGTGGCT
GGATTAATTA ACGCCATCCG CGCTGTTCGT TAA
 
Protein sequence
MFDFSKVVDR HGTWCTQWDY VADRFGTADL LPFTISDMDF ATAPCIIEAL NQRLMHGVFG 
YSRWKNDEFL AAIAHWFSTQ HYTAIDSQTV VYGPSVIYMV SELIRQWSET GEGVVIHTPA
YDAFYKAIEG NQRTVMPVAL EKQADGWFCD MGKLEAVLAK PECKIMLLCS PQNPTGKVWT
CDELEIMADL CERHGVRVIS DEIHMDMVWG EQPHIPWSNV ARGDWALLTS GSKSFNIPAL
TGAYGIIENS SSRDAYLSAL KGRDGLSSPS VLALTAHIAA YQQGAPWLDA LRIYLKDNLT
YIADKMNAAF PELNWQIPQS TYLAWLDLRP LNIDDNALQK ALIEQEKVAI MPGYTYGEEG
RGFVRLNAGC PRSKLEKGVA GLINAIRAVR