Gene EcolC_2668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2668 
Symbol 
ID6066780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2924649 
End bp2925839 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content52% 
IMG OID641602074 
Productaromatic amino acid aminotransferase 
Protein accessionYP_001725624 
Protein GI170020670 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1448] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0040813 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.145428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGAGA ACATTACCGC CGCTCCTGCC GACCCGATTC TGGGCCTGGC CGATCTGTTT 
CGTGCCGATG AACGTCCCGG CAAAATTAAC CTCGGGATTG GTGTCTATAA AGATGAGACG
GGCAAAACCC CGGTACTGAC CAGCGTGAAA AAGGCTGAAC AGTATCTGCT CGAAAATGAA
ACCACCAAAA ATTACCTCGG CATTGACGGC ATCCCTGAAT TTGGTCGCTG CACTCAGGAA
CTGCTGTTTG GTAAAGGTAG CGCCCTGATC AATGACAAAC GTGCTCGCAC GGCACAGACT
CCGGGGGGCA CTGGCGCACT ACGCGTAGCT GCCGATTTCC TGGCAAAAAA TACCAGCGTT
AAGCGAGTGT GGGTGAGCAA CCCAAGCTGG CCGAACCATA AGAGCGTCTT TAACTCTGCA
GGTCTGGAAG TTCGTGAATA CGCTTATTAT GATGCGGAAA ATCACACTCT TGACTTCGAT
GCACTGATTA ACAGCCTGAA TGAAGCTCAG GCTGGCGACG TAGTGCTGTT CCATGGCTGC
TGCCATAACC CAACCGGTAT CGACCCTACG CTGGAACAAT GGCAGACACT GGCACAACTC
TCCGTTGAGA AAGGCTGGTT ACCGCTGTTT GACTTCGCTT ACCAGGGTTT TGCCCGTGGT
CTGGAAGAAG ATGCTGAAGG ACTGCGCGCT TTCGCGGCTA TGCATAAAGA GCTGATTGTT
GCCAGTTCCT ACTCTAAAAA CTTTGGCCTG TACAACGAGC GTGTTGGCGC TTGTACTCTG
GTTGCTGCCG ACAGTGAAAC CGTTGATCGC GCATTCAGCC AAATGAAAGC GGCGATTCGC
GCTAACTACT CTAACCCACC AGCACACGGC GCTTCTGTTG TTGCCACCAT CCTGAGCAAC
GATGCGTTAC GTGCGATTTG GGAACAAGAG CTGACTGATA TGCGCCAGCG TATTCAGCGT
ATGCGTCAGT TGTTCGTCAA TACGCTGCAG GAAAAAGGCG CAAACCGCGA CTTCAGCTTT
ATCATCAAAC AGAACGGCAT GTTCTCCTTC AGTGGCCTGA CAAAAGAACA AGTGCTGCGT
CTGCGCGAAG AGTTTGGCGT ATATGCGGTT GCTTCTGGTC GCGTAAATGT GGCCGGGATG
ACACCAGATA ACATGGCTCC GCTGTGCGAA GCGATTGTGG CAGTGCTGTA A
 
Protein sequence
MFENITAAPA DPILGLADLF RADERPGKIN LGIGVYKDET GKTPVLTSVK KAEQYLLENE 
TTKNYLGIDG IPEFGRCTQE LLFGKGSALI NDKRARTAQT PGGTGALRVA ADFLAKNTSV
KRVWVSNPSW PNHKSVFNSA GLEVREYAYY DAENHTLDFD ALINSLNEAQ AGDVVLFHGC
CHNPTGIDPT LEQWQTLAQL SVEKGWLPLF DFAYQGFARG LEEDAEGLRA FAAMHKELIV
ASSYSKNFGL YNERVGACTL VAADSETVDR AFSQMKAAIR ANYSNPPAHG ASVVATILSN
DALRAIWEQE LTDMRQRIQR MRQLFVNTLQ EKGANRDFSF IIKQNGMFSF SGLTKEQVLR
LREEFGVYAV ASGRVNVAGM TPDNMAPLCE AIVAVL