Gene Arth_1755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1755 
Symbol 
ID4445724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1962049 
End bp1963542 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content67% 
IMG OID639689575 
Productsuccinate semialdehyde dehydrogenase 
Protein accessionYP_831247 
Protein GI116670314 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCC AGGCAACCAC CAACCCGCAG ACCTCGCAGA AGGCGCTCGA CGCCGTCGCC 
AAGGTCAGCA CCAACCTCTA CATTGACGGA GAATGGGCCG AAGCGGCCTC CGGCGCCCGG
TTCGACGTCA TCAACCCCGC CACCGAGGAA GTCATCGCTT CCGTCGCCGA CGGCGGCCCC
GAGGACGCCC GCCGCGCCAT CGAAACCGCC GGCCGCGTGC AGAAGCAGTG GGCCAAGACC
GCACCCCGGG AGCGCAGCGA GATCCTGCGC CGCGCCTTCG ATCTGATCAT GGCCCGCCAG
GACGAGCTGG CCCTGATCAT GACCACGGAA ATGGGCAAGC CCTTCGCCGA GGCCAAGGGC
GAGGTGGCCT ACGCCGCCGA GTTCTTCCGC TGGTTCTCGG AGGAAGCCGT CCGCATCGGC
GGTGACATGA CCACCACCGG CGACGGCAAG AACCGCATTC TGGTCACGAA GGAGCCGGTT
GGCCCGTGCG TCCTGGTGAC GCCCTGGAAC TTCCCGCTGG CCATGGGCAC CCGGAAGATC
GGCCCCGCCA TCGCAGCGGG CTGCACCATA GTCTTCAAGC CGGCCAACCT CACCCCGCTG
TCCTCGCTGG CGCTGGCGGA CATCCTGATC GAGGCCGGCC TCCCCAAAGG CGTACTGAAC
GTTGTCACCA CCACCAAAGC CTCAGAGGTG GTGACCCCGT GGATGGAAAG CGGCATTGCC
CGCAAGGTCA GCTTCACCGG TTCCACCGGC GTGGGCGTGC GCCTGCTGGA GCAGGCGGCC
AAGAACGTCA TGCGCTCCTC GATGGAACTG GGCGGAAACG CACCCCTCAT CGTGTTCGAG
GACGCAGACC TGGACCGCGC CGTGGAAGGT GCGTTCGCCG CCAAGATGCG GAACATGGGC
GAGGCCTGCA CAGCCGCCAA CCGCATCTTC GTCCAGCGCT CCGTTTCCGC CGACTTCTCC
GCCCGGCTCG CCAAGCGGCT CGGTGCCCTG AAAGTGGGCG ACGGCGCAGT GGACGGCACC
GACGTCGGCC CCCTCGTGGA GGAGAAAGCG CTGAACAAGG TCCAGGAACT CGTGGATGAC
GCCGTTTCCA AGGGCGCCAC CGTGATCTGC GGCGGGTCCC GCCCCGAGGG CAAAGGCTAC
TTCTACTCCC CCACCGTGCT GTCCGATGTC AGCTCCGACG CCGCACTGAT GAGCGAGGAA
ATCTTCGGCC CGGTGGCCCC AATCATCCCC TTCGACACCG AAGAGGAAGT GGTCCGGCTG
GCCAACGACA CCCCGTGGGG CCTGGCCAGC TACGTGTTCA CCCAGGACCT GGACCGCGCC
TTCCGCGTCG GCGACGAACT CGAGGTAGGC ATGGTTGGCC TGAACACGGG CATCGTCTCC
AACCCGGCAG CGCCGTTCGG CGGCATCAAG GCCTCCGGCC TGGGCCGTGA AGGCGGACGC
GTGGGCCTGG ACGAGTTCCT GGAGATCAAG TACATGGCAA TCCCGCGCGT CTAA
 
Protein sequence
MSIQATTNPQ TSQKALDAVA KVSTNLYIDG EWAEAASGAR FDVINPATEE VIASVADGGP 
EDARRAIETA GRVQKQWAKT APRERSEILR RAFDLIMARQ DELALIMTTE MGKPFAEAKG
EVAYAAEFFR WFSEEAVRIG GDMTTTGDGK NRILVTKEPV GPCVLVTPWN FPLAMGTRKI
GPAIAAGCTI VFKPANLTPL SSLALADILI EAGLPKGVLN VVTTTKASEV VTPWMESGIA
RKVSFTGSTG VGVRLLEQAA KNVMRSSMEL GGNAPLIVFE DADLDRAVEG AFAAKMRNMG
EACTAANRIF VQRSVSADFS ARLAKRLGAL KVGDGAVDGT DVGPLVEEKA LNKVQELVDD
AVSKGATVIC GGSRPEGKGY FYSPTVLSDV SSDAALMSEE IFGPVAPIIP FDTEEEVVRL
ANDTPWGLAS YVFTQDLDRA FRVGDELEVG MVGLNTGIVS NPAAPFGGIK ASGLGREGGR
VGLDEFLEIK YMAIPRV