Gene ECD_00476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00476 
SymbolcysS 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp523698 
End bp525083 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content55% 
IMG OID 
Productcysteinyl-tRNA synthetase 
Protein accessionACT42375 
Protein GI253976705 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0063567 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAAAA TCTTCAATAC TCTGACACGC CAAAAAGAGG AATTTAAGCC TATTCACGCC 
GGGGAAGTCG GCATGTACGT GTGTGGAATC ACCGTTTACG ATCTCTGTCA TATCGGTCAC
GGGCGTACCT TTGTTGCTTT TGACGTGGTT GCGCGCTATC TGCGTTTCCT CGGCTATAAG
CTGAAGTATG TGCGCAACAT TACCGATATC GACGACAAAA TCATCAAACG CGCCAATGAA
AATGGCGAAA GCTTTGTGGC GCTGGTGGAT CGCATGATCG CCGAAATGCA CAAAGATTTT
GATGCGTTGA ACATTCTGCG CCCGGATATG GAGCCGCGCG CGACGCACCA TATCGCAGAA
ATTATTGAAC TCACTGAACA ACTGATCGCC AAAGGTCACG CTTATGTGGC GGACAACGGC
GACGTGATGT TCGACGTCCC GACCGATCCA ACTTATGGCG TGCTGTCGCG TCAGGATCTC
GACCAGCTGC AGGCAGGCGC GCGCGTTGAC GTGGTCGACG ACAAACGCAA CCCAATGGAC
TTCGTTCTGT GGAAGATGTC GAAAGAGGGC GAACCGAGCT GGCCGTCTCC GTGGGGCGCG
GGTCGTCCTG GCTGGCACAT TGAATGTTCG GCAATGAACT GCAAGCAGCT GGGTAACCAC
TTTGATATCC ACGGCGGCGG TTCAGACCTG ATGTTCCCGC ACCACGAAAA CGAAATCGCG
CAGTCCACCT GTGCCCATGA TGGTCAGTAT GTGAACTACT GGATGCACTC GGGGATGGTG
ATGGTTGACC GCGAGAAGAT GTCCAAATCG CTGGGTAACT TCTTTACCGT GCGCGATGTG
CTGAAATACT ACGACGCGGA AACCGTGCGT TACTTCCTGA TGTCGGGCCA CTATCGCAGC
CAGCTGAACT ATAGCGAAGA GAACCTGAAG CAGGCGCGTG CGGCGCTGGA GCGTCTCTAC
ACTGCGCTGC GCGGCACAGA TAAAACCGTT GCGCCTGCCG GTGGCGAAGC GTTTGAAGCG
CGCTTTATTG AAGCGATGGA CGACGATTTC AACACCCCGG AAGCCTATTC CGTGCTGTTT
GATATGGCGC GTGAAGTAAA CCGTCTGAAA GCAGAAGATA TGGCAGCGGC GAATGCAATG
GCATCTCACC TGCGTAAACT TTCCGCCGTA TTGGGCCTGC TGGAGCAAGA ACCGGAAGCG
TTCCTGCAAA GCGGCGCGCA GGCAGACGAC AGCGAAGTGG CTGAGATTGA AGCGTTAATT
CAACAGCGTC TGGATGCCCG TAAAGCGAAA GACTGGGCGG CGGCAGATGC GGCGCGTGAC
CGTCTTAATG AGATGGGGAT CGTGCTGGAA GATGGCCCGC AAGGGACCAC CTGGCGTCGT
AAGTAA
 
Protein sequence
MLKIFNTLTR QKEEFKPIHA GEVGMYVCGI TVYDLCHIGH GRTFVAFDVV ARYLRFLGYK 
LKYVRNITDI DDKIIKRANE NGESFVALVD RMIAEMHKDF DALNILRPDM EPRATHHIAE
IIELTEQLIA KGHAYVADNG DVMFDVPTDP TYGVLSRQDL DQLQAGARVD VVDDKRNPMD
FVLWKMSKEG EPSWPSPWGA GRPGWHIECS AMNCKQLGNH FDIHGGGSDL MFPHHENEIA
QSTCAHDGQY VNYWMHSGMV MVDREKMSKS LGNFFTVRDV LKYYDAETVR YFLMSGHYRS
QLNYSEENLK QARAALERLY TALRGTDKTV APAGGEAFEA RFIEAMDDDF NTPEAYSVLF
DMAREVNRLK AEDMAAANAM ASHLRKLSAV LGLLEQEPEA FLQSGAQADD SEVAEIEALI
QQRLDARKAK DWAAADAARD RLNEMGIVLE DGPQGTTWRR K