Gene Achl_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_0844 
Symbol 
ID7292281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp916666 
End bp917856 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content69% 
IMG OID643589245 
ProductCys/Met metabolism pyridoxal-phosphate-dependent protein 
Protein accessionYP_002486928 
Protein GI220911619 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.187169 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTTT CCGAACAGCA GGCCGCCGGC CTGTCCGCCG AAACCATGGT AGTGGCCGCG 
GGCCGCCCGC CGCGGGAACG GGACCAGCCG GTGAACCCGC CCCTGGTCCT GTCCTCCACG
TACTACGGCA CAGGCCCGCT CGGCCCCGGA GACCGTGGTT ACGGCCGGTA CTCCAACCCC
ACGTGGGACC CGTTCGAAGA GGCCCTCGGC CAGCTGGAAG GAGCGGACCT GCAGGGCCTG
CTCTACGCCT CCGGGCTTGC TGCCGTCAGT TCTGCGTTGT CGCTGATCCC CTCCGGCGGA
GTGCTGGTGA TGCCGAACCA CAGCTACTCC GGAACCCTTG TCATGGCCGC GGAGCTGGCC
CAGAAAGGGT TCATTGAGCT GCGAACGGTG GACATCGCCG ATACCGGGGC CGTCAAAGCG
GCCATCGCCC CGGGCGGGCC GGACGCCAGG GCCGCCGCCA TGCTGTGGCT GGAAAGCCCT
ACCAACCCCA TGCTGGGAAT CGCCGACATT CCTGCGCTGA CGGAAGCCGC GCACGCGTCG
GGCGCCATCG TGGTCACGGA CAATACCTTC TCGACGCCGC TGGTGCAGCA GCCCCTGGCC
CTGGGCTCCG ACGTCGTACT CCACTCGGTG ACCAAGTACC TGGCGGGCCA CTCGGACGTC
GTCCTGGGTG CCCTGGTGAC CTCCAACGCG GACATCCGCT CCGCCCTCCT GCACCACCGG
ACCATCCACG GAGCGATCGC CGGGCCCTTT GAGGCCTGGC TTGCGCTGCG CGGCCTGCGC
ACCCTGGCGC TGCGCGTTGA AAAGTCCCAG GAGTCTGCCA AGGTCCTGGC GGAACGGCTG
GGCACCCACC CGGGGGTCGA ATCGATCCGG TTCCCGGGCC TCCGCACCGA TCCCGGGCAC
GAAAGGGCCG CGGCGCAGAT GAAGGGCTTT GGCTCCATCA TCTGCATCCA GGTGGCGCCC
GCCGGCGGAC TGGACGGCGC AGCAGCGGCC GACAAGGTCG TTGAAGCCGT CAACCTCTGG
CTGCCGGCCA CATCGCTGGG CGGCGTGGAA TCGCTGATCG AACGCAGGCG GCGGCACACC
GCGGAGCCGG CCACGGTGCC GGAGAACCTG GTCCGCCTCA GCACCGGCAT TGAGAACGTG
GAAGACCTCT GGGCAGACCT GGAGCAGGCG CTGGACACTC TGGGCGGCTA G
 
Protein sequence
MSLSEQQAAG LSAETMVVAA GRPPRERDQP VNPPLVLSST YYGTGPLGPG DRGYGRYSNP 
TWDPFEEALG QLEGADLQGL LYASGLAAVS SALSLIPSGG VLVMPNHSYS GTLVMAAELA
QKGFIELRTV DIADTGAVKA AIAPGGPDAR AAAMLWLESP TNPMLGIADI PALTEAAHAS
GAIVVTDNTF STPLVQQPLA LGSDVVLHSV TKYLAGHSDV VLGALVTSNA DIRSALLHHR
TIHGAIAGPF EAWLALRGLR TLALRVEKSQ ESAKVLAERL GTHPGVESIR FPGLRTDPGH
ERAAAQMKGF GSIICIQVAP AGGLDGAAAA DKVVEAVNLW LPATSLGGVE SLIERRRRHT
AEPATVPENL VRLSTGIENV EDLWADLEQA LDTLGG