Gene EcolC_3866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3866 
Symbol 
ID6066496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4221418 
End bp4222446 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content53% 
IMG OID641603281 
Productlysine 2,3-aminomutase YodO family protein 
Protein accessionYP_001726797 
Protein GI170021843 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1509] Lysine 2,3-aminomutase 
TIGRFAM ID[TIGR00238] KamA family protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.358809 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCATA TTGTAACCCT AAATACCCCA TCCAGAGAAG ATTGGTTAAC GCAACTTGCC 
GATGTTGTGA CCGATCCTGA TGAACTTCTG CGTCTTTTGA ATATAGACGC GGACGAAAAA
CTGTTGGCCG GACGCAGCGC CAAAAAGCTG TTTGCCCTGC GTGTGCCCCG CTCATTTATC
GATCGCATGG AGAAAGGCAA TCCGGAAGAT CCTCTTTTGC GTCAGGTACT TACCTCGCAA
GATGAGTTTG TCGTCGCGTC CGGATTCTCC ACCGACCCGC TGGAAGAACA GCACAGCGTA
GTGCCTGGTT TGTTGCATAA ATACCACAAC CGGGCGCTTT TGCTGGTCAA AGGCGGCTGC
GCGGTAAATT GCCGCTATTG CTTCCGTCGC CACTTCCCCT ATGCCGAAAA TCAGGGCAAC
AAGCGTAACT GGCAGACTGC TCTTGAGTAT GTTGCTGCGC ATCCGGAACT GGACGAGATG
ATTTTCTCCG GCGGCGATCC GCTGATGGCG AAAGATCACG AGCTGGACTG GTTGCTCACA
CAACTGGAAG CCATCCCGCA TATCAAACGT CTGCGGATTC ACAGCCGTCT GCCGATTGTG
ATCCCGGCAC GTATCACCGA GGCGCTGGTT GAACGCTTTG CCCGTTCTAC GCTGCAAATC
TTGCTGGTGA ATCACATCAA TCATGCCAAT GAGGTAGATG AAACGTTCCG TCAGGCGATG
GCTAAATTGC GCCGTGTCGG TGTCACCCTG CTTAACCAGA GCGTTCTGTT ACGTGGTGTG
AACGATAACG CACAAACGCT GGCAAACCTG AGTAATGCGT TGTTCGATGC CGGCGTAATG
CCGTATTACC TGCATGTGCT CGATAAAGTA CAGGGCGCGG CGCATTTTAT GGTGAGTGAT
GACGAAGCAC GGCAGATTAT GCGTGAGTTG CTGACACTGG TGTCGGGTTA TCTGGTGCCG
AAACTGGCGC GAGAAATCGG CGGCGAACCC AGCAAAACGC CGCTGGATCT CCAGCTACGC
CAGCAGTAA
 
Protein sequence
MAHIVTLNTP SREDWLTQLA DVVTDPDELL RLLNIDADEK LLAGRSAKKL FALRVPRSFI 
DRMEKGNPED PLLRQVLTSQ DEFVVASGFS TDPLEEQHSV VPGLLHKYHN RALLLVKGGC
AVNCRYCFRR HFPYAENQGN KRNWQTALEY VAAHPELDEM IFSGGDPLMA KDHELDWLLT
QLEAIPHIKR LRIHSRLPIV IPARITEALV ERFARSTLQI LLVNHINHAN EVDETFRQAM
AKLRRVGVTL LNQSVLLRGV NDNAQTLANL SNALFDAGVM PYYLHVLDKV QGAAHFMVSD
DEARQIMREL LTLVSGYLVP KLAREIGGEP SKTPLDLQLR QQ