Gene EcSMS35_0621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0621 
Symbol 
ID6143367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp634440 
End bp635600 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content54% 
IMG OID641615513 
Productputative aminotransferase 
Protein accessionYP_001742719 
Protein GI170683545 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAATA ACCCTCTGAT TCCACAAAGC AAACTTCCAC AACTTGGCAC CACTATTTTC 
ACCCAGATGA GTGCACTGGC GCAGCAACAC CAGGCGATTA ACCTGTCGCA AGGCTTTCCT
GATTTTGATG GTCCGCGCTA TTTGCAGGAA CGGCTGGCGT ACCACGTTGC CCAGGGGGCA
AACCAATACG CACCCATGAC CGGCGTGCAG GCATTACGCG AGGCGATTGC TCAGAAAACG
GAACGGCTGT ATGGCTATCA ACCAGATGCC GACAGCGATA TCACCGTAAC AGCAGGGGCG
ACGGAAGCGC TATACGCAGC GATTACTGCA TTGGTTCGCA ATGGCGATGA AGTGATTTGC
TTTGATCCCA GCTATGACAG TTACGCGCCC GCCATCGCGC TTTCGGGGGG AATAGTGAAA
CGTATTGCAC TGCAACCACC GCATTTTCGC GTGGACTGGC AGGAATTTGC CGCATTGTTA
AGCGAGCGCA CCCGACTGGT GATCCTCAAC ACCCCGCATA ACCCCAGTGC AACTGTCTGG
CAGCAGGCTG ATTTCTCCGC TCTGTGGCAG GCGATTGCCG GGCACGAGAT TTTTGTCATT
AGCGATGAAG TCTACGAGCA CATCAATTTT TCACAACAGG GCCATGCCAG CGTGCTGGCG
CATCCGCAAT TGTGTGAGCG AGCGGTGGCG GTGTCATCGT TTGGCAAGAC CTATCATATG
ACCGGCTGGA AAGTGGGGTA TTGCGTTGCA CCTGCGCCCA TCAGCGCCGA GATCCGCAAA
GTGCATCAGT ATCTGACCTT TTCGGTGAAT ACCCCGGCAC AGTTGGCGCT TGCCGATATG
CTACGTGCAG AACCTGAGCA TTATCTTGCG TTACCGGACT TTTATCGCCA GAAGCGCGAT
ATTCTGGTGA ATGCCTTAAA TGAAAGTCGG CTGGAGATTT TACCGTGCGA AGGTACATAC
TTTTTGCTGG TGGATTACAG CGCCGTATCT ACGCTGGATG ATGTTGAGTT TTGCCAGTGG
CTGACGCGGG AGCACGGCGT AGCGGCGATT CCGCTGTCGG TGTTTTGCGC CGATCCCTTC
CCACATAAAC TGATTCGTCT CTGTTTTGCC AAGAAGGAAT CGACGTTGCT GGCAGCAGCA
GAACGCCTGC GCCAGCTTTA G
 
Protein sequence
MTNNPLIPQS KLPQLGTTIF TQMSALAQQH QAINLSQGFP DFDGPRYLQE RLAYHVAQGA 
NQYAPMTGVQ ALREAIAQKT ERLYGYQPDA DSDITVTAGA TEALYAAITA LVRNGDEVIC
FDPSYDSYAP AIALSGGIVK RIALQPPHFR VDWQEFAALL SERTRLVILN TPHNPSATVW
QQADFSALWQ AIAGHEIFVI SDEVYEHINF SQQGHASVLA HPQLCERAVA VSSFGKTYHM
TGWKVGYCVA PAPISAEIRK VHQYLTFSVN TPAQLALADM LRAEPEHYLA LPDFYRQKRD
ILVNALNESR LEILPCEGTY FLLVDYSAVS TLDDVEFCQW LTREHGVAAI PLSVFCADPF
PHKLIRLCFA KKESTLLAAA ERLRQL