Gene Arth_1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1944 
Symbol 
ID4445528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2193691 
End bp2195130 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content69% 
IMG OID639689754 
Productsuccinate semialdehyde dehydrogenase 
Protein accessionYP_831426 
Protein GI116670493 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACCTCA AATCAGCCCA ACACCTCGTC AACGGCACCT GGCACGCTAC CGGGACCTCC 
AAGCACGTGA CGGACCCCGG AAACGGCAGC ACCGTAGGTG AGGTCGCCTG GGGCACCGCC
GGGGATGCCA CCCAGGCAGC CGACGCGGCC GCGGAGGCCC TTGGGTCCTG GTCACGCACC
ACGGTGCGCA ACCGCGCCGA CCTGCTCCGC AGCGCAGCCG ACCTCCTTGC CGAACGCCGC
GACGAACTCG CCCACACCCT GGCGCTCGAG GCGGGCAAGC GGCTCCCTGA AGCCCAGGGC
GAGGTGGACT TCTCGGTGGA ATACTTCCGC TGGTTCGCCG AGGAAGTCCG CCGCTCCACC
GGCACCGTCA GCCCGCCCGA ACTCCAGGGC CGGCGCCACC TCAGCCTCCG TAAACCTATC
GGCGTGGCAC TCAGCCTCAC CCCATGGAAC TTTCCCGTAT CCATCCAGGC CCGCAAACTC
GCCGCAATGC TGGCCGCAGG CTGCACCGTG GTGGGCCGGG TCTCCGAAAA GGCGCCGCTC
GCCGCCACCG GCCTGTTTGA GGTCCTGCAC GACGCCGGGT TCCCCGCCGG CGTCGTCAAC
CTGGTCCACG GGCCCTCGCG CGAAATTACC GCCGCCCTGA TGTCCCACCC GGCGGTCCGG
GCCGTCAGTT TCACCGGTTC CACAGGCGTG GGCCGGCAGA TCATGGCGTC TGCATCGGAA
CGCGTGGTCC GGCCCCTGCT CGAACTGGGC GGCAATGCAC CCTTCATCGT GTTCGAGGAT
GCCGACCTGG ATGCCGCCGT CGAAGGTGCC GTCCTGGGCC GTCTCCGCAA CACAGGCCAG
TCCTGCGTGG CCGCCAACCG GTTCCTGGTC CAGGACAGCA TCGCCGAGGA ATTTTCGCAG
AAACTGGCGG CGCGGTTCGA CGCCATGAGC ATCGGCCACG GCGTTCCCGA CGACGGTTCT
GACGTGCCGG ACCTCGGCCC CATGATCGAC GCCGATCGGG TGGCCGCCGT CCAGGCGCTG
GTGGACGACG CCCTCGCGCG CGGCGCACGC CGCGTCACGC AGCGGACCGA TGTTCCGGCG
CGCGGCGCGT TCATGGCTCC CACACTGCTC ACGGACGTCC CCGACGACGC ACCCCTGGTG
AGCGAAGAAG TGTTCGGCCC GGCGGCCGGC GTCGTGACCT TCACGTCGGA AGAGGACGCT
ATCCGCAAGG CGAACGCAAC CGAGATGGGC CTCGCCGCTT ACCTCTGGAG CCGCGATCCC
AAGCGCGCCT GGGACATCCC CGAACGCCTG GAAGCCGGCA TCGTGGGGGT CAACGATCCC
CTCCCCTCCG TAGCGTTCGC CCCCATGGGC GGCGCCAAGC AGTCCGGTCT GGGCCGCGAA
GGAGCAAGCC TTGGCCTCGA GGAGTTCGAG GAGGTCCAGT ACGTGGCCTG GAGGCCGTAA
 
Protein sequence
MNLKSAQHLV NGTWHATGTS KHVTDPGNGS TVGEVAWGTA GDATQAADAA AEALGSWSRT 
TVRNRADLLR SAADLLAERR DELAHTLALE AGKRLPEAQG EVDFSVEYFR WFAEEVRRST
GTVSPPELQG RRHLSLRKPI GVALSLTPWN FPVSIQARKL AAMLAAGCTV VGRVSEKAPL
AATGLFEVLH DAGFPAGVVN LVHGPSREIT AALMSHPAVR AVSFTGSTGV GRQIMASASE
RVVRPLLELG GNAPFIVFED ADLDAAVEGA VLGRLRNTGQ SCVAANRFLV QDSIAEEFSQ
KLAARFDAMS IGHGVPDDGS DVPDLGPMID ADRVAAVQAL VDDALARGAR RVTQRTDVPA
RGAFMAPTLL TDVPDDAPLV SEEVFGPAAG VVTFTSEEDA IRKANATEMG LAAYLWSRDP
KRAWDIPERL EAGIVGVNDP LPSVAFAPMG GAKQSGLGRE GASLGLEEFE EVQYVAWRP