Gene TM1040_1439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1439 
Symbol 
ID4078069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1536892 
End bp1538076 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content60% 
IMG OID638006750 
Productcysteine synthase A 
Protein accessionYP_613434 
Protein GI99081280 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.159205 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.177571 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTTGC GGGCAAGTCA GACGCAGTGG AACCACGGTC GCGCCGAAAT CGTCCCCGAC 
AGGCGAAACG CCAGCCTGCG CGCGCTCTGG ACCCTTGTGA TCGCGCCGGT GCCCAGCTAC
ATCAGAGGGG AACGAGCGAA AGGGAATTTC ATGCGCATTG CACGCGATCT GGCCGATGCG
GTCGGCCATA CCCCTCTGAT CCGACTGAAC AAAGTCAGCG AAGAAACCGG TTGCGAAATT
CTTGGCAAGG CGGAGTTTCT GAACCCCGGC CAATCCGTGA AGGACCGCGC CGCGCTCTAC
ATCATCAAGG ACGCTATCGC CCGCGGGGAA CTCAAACCCG GCGGCACGAT TGTAGAAGGT
ACAGCGGGCA ATACCGGCAT TGGGCTTGCG CTGGTCGGCG CCTCCATGGG CTTCAAGACC
GTGATCGTGA TCCCTGAAAC CCAGTCCGAA GAAAAGAAGG ACATGCTGCG TCTGGCGGGG
GCACAGTTGG TTCAGGTGCC TGCAGCGCCT TACAAGAACC CCAACAACTA TGTGCGCTAC
TCCGAGCGCC TGGCAAACGA GCTGGCCAAG ACAGAGCCGA ATGGCGCCAT CTGGGCCAAT
CAGTTTGACA ATGTGGCCAA CCGTCAGGCC CATGTGGAAA CCACCGGTCC CGAGATCTGG
GAGCAGACGG GCGGCAAGGT GGATGGTTTC ATCTGTGCTG TTGGCTCCGG CGGGACGCTT
GCCGGGATTG CCGAGGCGCT GCAGCCCAAG GGCGTGAAGA TCGGCCTTGC CGATCCCAAT
GGCGCCGCGC TTTATTCCTA CTACACCACT GGCGAGCTGA AGTCCGAGGG CTCCTCGATC
ACAGAAGGGA TCGGTCAGGG CCGGATCACC AAGAACCTCG AGGGGCTCAC CCCAGATTTC
AACTACCAGA TTCCGGACGC AGAGGCGCTG CCTTATGTGT TTGACCTTCT GCACGAAGAG
GGGCTGGTGC TTGGCGGATC TTCCGGCGTG AACATTGCCG GCGCAGTGCG TCTGGCCAAG
GAACTGGGGC CGGGGCACAC CATCGTTACC ATCCTGTGCG ACTATGGAAC GCGCTATCAG
TCCAAGCTCT TTAATCCGGA ATTCCTCAAA GACAAGGGCT TGCCGGTCCC CGACTGGATG
ACCCATACTC CTGCTTCAAT TCCCGGAGTA TTCGAGGACA TCTGA
 
Protein sequence
MVLRASQTQW NHGRAEIVPD RRNASLRALW TLVIAPVPSY IRGERAKGNF MRIARDLADA 
VGHTPLIRLN KVSEETGCEI LGKAEFLNPG QSVKDRAALY IIKDAIARGE LKPGGTIVEG
TAGNTGIGLA LVGASMGFKT VIVIPETQSE EKKDMLRLAG AQLVQVPAAP YKNPNNYVRY
SERLANELAK TEPNGAIWAN QFDNVANRQA HVETTGPEIW EQTGGKVDGF ICAVGSGGTL
AGIAEALQPK GVKIGLADPN GAALYSYYTT GELKSEGSSI TEGIGQGRIT KNLEGLTPDF
NYQIPDAEAL PYVFDLLHEE GLVLGGSSGV NIAGAVRLAK ELGPGHTIVT ILCDYGTRYQ
SKLFNPEFLK DKGLPVPDWM THTPASIPGV FEDI