Gene Arth_3433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3433 
Symbol 
ID4444163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3866523 
End bp3868121 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content66% 
IMG OID639691257 
Productmalate synthase 
Protein accessionYP_832908 
Protein GI116671975 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.680692 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCTCA CAGTCACAGA CCCGCAGCCG ATTGCACGAG CAGAGGAAAT CCTCACCCCG 
AAGGCACTGG CCTTCGTGGA GGAGCTCCAC AAGCGGTTTG CGGGCACCCG CGCCGAACTC
CTCAAAGCCC GCGTGGCCAA GCGCGAGCAG GTAGCCCGGA CCGGAAAGCT TGATTTCCTG
TCCGAAACCA AGGATGTCCG CGAGGGCGAC TGGAAGGTTG CGCCGGCGCC CGCCGCGCTG
CAGGACCGCC GCGTGGAAAT GACCGGACCC GCCTCGCCGG CCAAGATGGC CATCAATGCC
CTCAACTCCG GCGCCAAGGT GTGGCTGGCC GACCTCGAGG ACGCCAGCAC GCCCACCTGG
GCCAACGTCA TTGACGCCAT CCTGAACCTC CGCGATGCCG CCACGGGAAC CCTGAGCTAC
ACCTCCCCGG AGGGCAAGGA ATACCGGCTC CGCTCCGACG CCCCGCTCGC CGTCGTGGTG
GCCCGGCCCC GCGGCTGGCA CATGGACGAG CACCACCTGC TGCTCGACGG CGAACACACC
GTGGGCGCGC TGGTGGACTT CGGCCTGCAC TTCTTCCACA CGGCCAAGCA GCTCCTGCTC
AACGGCCAGG GCCCGTACTA CTACCTGCCG AAGATGGAGA GCCACCTTGA GGCGCGCCTC
TGGAACGACG TGTTCGTCTT CGCGCAGGAT TTCCTGGGCA TTCCGCAGGG CACCATCAAG
GCCACCGTGC TGATCGAGAC GATCCCCGCG GCCTTCGAGA TGGACGAGAT CCTGTACGAG
CTCCGCGACC ACGCCGCCGG GCTCAACGCC GGCCGCTGGG ACTACCTGTT CAGCATCATC
AAGTACTTCC GCGACGCCGG AGCGGACTTT GTACTGCCGG ACCGCGCCAC CGTGGCCATG
ACGGCACCGT TCATGCGGGC CTACACCGAG CTGCTCGTCA AGACCTGCCA CCACCGCGGC
GCGTTTGCCA TGGGAGGCAT GGCCGCAGTC ATCCCCAACC GTCGCGAACC CGAGGTCACC
GCCCAGGCAT TCGAGAAGGT CCGCGCCGAC AAGACGCGCG AAGCCAACGA CGGCTTTGAC
GGCTCTTGGG TGGCCCACCC TGACCTGGTG CCGGTGTGCC GGGAAGTGTT CGATTCCGTC
CTGGGCGAGC GCCCCAACCA GCTGGACAAG CAGCGCCCGG AGGTCCATGT CACGGCGGAC
CAACTGCTGG ACATTGCCTC GGCCGACGGC ACGGTGACGG AGGCCGGGCT GCGGCTGAAC
CTCTACGTTG CGGTCGCATA CACGGCTGTT TGGATCTCAG GCAACGGTGC GGTGGCCATC
CACAACCTGA TGGAAGACGC CGCCACGGCC GAGATCTCCC GTTCGCAGGT CTGGCAGCAG
ATCCGCAACA AGGTGGTCCT GGCCGATACC GGCAACACTG TCACCCGTGA ACTGGTCAGC
AGCATCCTGG CCCAGGAAAC CGACAAGCTC CGCGGTGAAG TGGGCGAGGA GACCTTCGCC
AAGTACTACC AGCCGGCCAG CGAACTGATC GCTGATATCT GCCTTTCCGA GGACTACACC
GACTTCCTCA CCACGCCGGC CTACGAACTG GTGGGCTGA
 
Protein sequence
MALTVTDPQP IARAEEILTP KALAFVEELH KRFAGTRAEL LKARVAKREQ VARTGKLDFL 
SETKDVREGD WKVAPAPAAL QDRRVEMTGP ASPAKMAINA LNSGAKVWLA DLEDASTPTW
ANVIDAILNL RDAATGTLSY TSPEGKEYRL RSDAPLAVVV ARPRGWHMDE HHLLLDGEHT
VGALVDFGLH FFHTAKQLLL NGQGPYYYLP KMESHLEARL WNDVFVFAQD FLGIPQGTIK
ATVLIETIPA AFEMDEILYE LRDHAAGLNA GRWDYLFSII KYFRDAGADF VLPDRATVAM
TAPFMRAYTE LLVKTCHHRG AFAMGGMAAV IPNRREPEVT AQAFEKVRAD KTREANDGFD
GSWVAHPDLV PVCREVFDSV LGERPNQLDK QRPEVHVTAD QLLDIASADG TVTEAGLRLN
LYVAVAYTAV WISGNGAVAI HNLMEDAATA EISRSQVWQQ IRNKVVLADT GNTVTRELVS
SILAQETDKL RGEVGEETFA KYYQPASELI ADICLSEDYT DFLTTPAYEL VG