Gene Arth_3860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3860 
Symbol 
ID4447559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4343010 
End bp4344218 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content67% 
IMG OID639691684 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_833335 
Protein GI116672402 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCTTCA ATCCCGACGC CGCTGGCTGG AGCCCTGACA CCCAGGCTGT CCGCGGCGGA 
CTTGACCGCA CCAATTTCCA GGAGACCGCC GAGCCGATCT TCCTCAACTC GGGGTTCGTC
TACGAATCCG CGGCAGCTGC CGAGCGGGCA TTCACCGGCG AGGACGAACG GTTTGTGTAC
TCCCGGTACG GCAACCCCTC CGTGGCCACC TTCCAGGAAC GCCTCCGGCT GCTCGAGGGC
ACCGAAGCGT GCTTTGCGAC GGCGTCCGGC ATGTCCGCCG TCTTTACGGC ACTGGGTGCC
CTGCTGGCTG CCGGTGACCG GGTGGTTGCC GCCCGGTCGC TGTTCGGCTC CTGCTTTGTC
ATTTTGAACG AGATCCTGCC CCGGTGGGGC GTCGAAACGG TGTTCGTGGA CGGCCCGGAC
CTGGAACAGT GGCGGACCGC TTTGGCGGAA CCGACGACGG CGGTCTTCTT CGAGTCGCCG
TCGAACCCGA TGCAGGAGAT CGTGGACATC GCCGCGGTCA GCGAACTGGC CCACGCTGCC
GGGGCGACCG TCGTCGTCGA CAATGTCTTT GCCACCCCCC TGCTGCAGCG CTGCGGGGAG
CTCGGCGCGG ATGTGGTGGT GTACTCCGGC ACCAAGCACA TTGACGGCCA GGGGCGGGTC
CTTGGCGGCG CCATCCTGGG CACCAAGGAG TTCATCGAAG GCCCGGTCAA GCAGCTGATG
CGCCACACCG GGCCGGCGCT TTCAGCCTTC AACGCCTGGG TGCTGACGAA GGGCCTGGAA
ACCATGGCGC TGCGCGTCAA CCATTCCTCG GCCTCCGCGC TTCGCCTGGC CGAGTGGCTG
GAAGGCCAGC CGGCGGTCAG CTGGGTCAAG TACCCGCTGC TGAAGTCCCA CCCGCAGTTT
GAGCTGGCGG CCAGGCAGAT GAAGGCCGGC GGTACCGTGC TCACGCTGGA GCTTCTGCCG
TCGGCGGGCC GCACGGCGAA GGAAGCAGCC TTTGCCCTGC TGGACGCGCT GCGGATCATC
GACATCTCCA ACAACCTCGG CGATGCCAAG ACGCTCATCA CCCACCCGGC CACCACCACG
CACCGTGCCA TGGGGCCGGA TGGCCGGGCC GCCATCGGGT TGAGCGACGG CGTGGTCCGC
CTGTCGGTAG GACTCGAGGA TGTTGACGAC CTCATCCGCG ACCTGGAGCA GGCGCTCAAA
CAGATCTGA
 
Protein sequence
MTFNPDAAGW SPDTQAVRGG LDRTNFQETA EPIFLNSGFV YESAAAAERA FTGEDERFVY 
SRYGNPSVAT FQERLRLLEG TEACFATASG MSAVFTALGA LLAAGDRVVA ARSLFGSCFV
ILNEILPRWG VETVFVDGPD LEQWRTALAE PTTAVFFESP SNPMQEIVDI AAVSELAHAA
GATVVVDNVF ATPLLQRCGE LGADVVVYSG TKHIDGQGRV LGGAILGTKE FIEGPVKQLM
RHTGPALSAF NAWVLTKGLE TMALRVNHSS ASALRLAEWL EGQPAVSWVK YPLLKSHPQF
ELAARQMKAG GTVLTLELLP SAGRTAKEAA FALLDALRII DISNNLGDAK TLITHPATTT
HRAMGPDGRA AIGLSDGVVR LSVGLEDVDD LIRDLEQALK QI