Gene Arth_1786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1786 
Symbol 
ID4445685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2000398 
End bp2001879 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content66% 
IMG OID639689604 
Productsuccinate semialdehyde dehydrogenase 
Protein accessionYP_831276 
Protein GI116670343 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCATAA CCCATAACCG AGAGGACGCG GTGCTGGATT CCGTCCCGAC CGGCCTGCTG 
ATCAACGGGC AGTGGCGCCC GGCCGCGTCC GGTAAGACGT TCGAGGTCGA AGACCCCGCC
ACCGGCAAGA TCCTCATGGC CATCGCTGAC GCAGGAGTCG AGGACAGCCT CGCGGCCCTG
GACGCGGCCG CTGCCGCGCA GGACTCCTGG GCACAGGTCC CGGCCCGTGA ACGCGGCGAA
ATCCTGCGCC GCGCCTTCGA AATGGTCACC GCCCGCGCCG AAGACTTCGC CCTGCTCATG
ACCGTGGAAA TGGGCAAACC ACTAGCCGAG GCCCGCGGCG AGGTCACCTA CGCCGCCGAA
TTCCTGCGTT GGTTCTCCGA AGAAGCCGTC CGCACCTCCG GCCGCTACTC GGTCTCCCCG
GACGGCAAAT CCCGGCTCCT GGTCACTAAG AAGCCCGTGG GTCCCTGCCT GCTGATCACC
CCGTGGAACT TCCCGCTGGC CATGGCCACC CGCAAGATCG CCCCCGCCGT GGCCGCCGGC
TGCACCATGA TCCTCAAACC CGCCAAACTC ACACCACTGA CCTCCCAGCT CTTCGCCGCG
CTCATGCAGG AAGCCGGACT GCCCGCCGGT GTCCTGAATG TCATCCCCTC CACCACCGCG
GGCGCCACCA CCGGTCCGCT GATCAAGGAC ACCCGGCTCC GCAAGCTCTC CTTCACTGGA
TCCACCGAGA TCGGCCGCCG CCTGCTCGCC GACGCCTCCG AAACCGTGCT GCGCACCTCG
ATGGAACTCG GCGGCAACGC CCCGTTCATC GTCTTCGAAG ACGCCGACAT CGACGCCGCC
GTCACCGGGG CGATGCTGGC CAAGCTGCGG AACATGGGCG AGGCCTGCAC GGCCGCGAAC
CGTTTCATCG TCCACAAGTC CGTGGCAACC CAATTCATCG AAAAATTCGC CGTGAAAATG
GCCGACATGA CGACGGCCCG CGGCACGGAG CCGGAGTCCA AGGTCGGTCC CCTGATCGAT
GCGAAGAGCA GGGATAAAGT CCACGAACTC GTCACGAACG CCATCGACTC CGGCGCCGCC
GCCGTGATCG GCGGCCGACC CGTGGACGGC CCAGGCTACT TCTACGAGCC CACCATCCTC
ACTGGCGTCA CCGAAGGCAC CCGCATCCTG TCCGAGGAAA TCTTCGGACC CGTCGCCCCC
ATCATCACCT TCACCACCGA GGACGGGGCC GTCCGGCTCG CGAACAACAC CGAGTACGGC
CTGGCCGCCT ACGTCTACAC CCGCGACCTG AACCGCGGCA TCCGGATGGG CGAACGCCTC
GAAACCGGCA TGCTCGGCCT CAACACCGGC GTCATCTCCA ACGCAGCCGC ACCGTTCGGC
GGCGTCAAGC AGTCCGGTCT GGGCCGCGAA GGCGGCCACG AAGGCATCGA AGAATACCTC
GACACCCAAT ACATAGGCGT AGCTGACCCT TACGCCAGCT AG
 
Protein sequence
MSITHNREDA VLDSVPTGLL INGQWRPAAS GKTFEVEDPA TGKILMAIAD AGVEDSLAAL 
DAAAAAQDSW AQVPARERGE ILRRAFEMVT ARAEDFALLM TVEMGKPLAE ARGEVTYAAE
FLRWFSEEAV RTSGRYSVSP DGKSRLLVTK KPVGPCLLIT PWNFPLAMAT RKIAPAVAAG
CTMILKPAKL TPLTSQLFAA LMQEAGLPAG VLNVIPSTTA GATTGPLIKD TRLRKLSFTG
STEIGRRLLA DASETVLRTS MELGGNAPFI VFEDADIDAA VTGAMLAKLR NMGEACTAAN
RFIVHKSVAT QFIEKFAVKM ADMTTARGTE PESKVGPLID AKSRDKVHEL VTNAIDSGAA
AVIGGRPVDG PGYFYEPTIL TGVTEGTRIL SEEIFGPVAP IITFTTEDGA VRLANNTEYG
LAAYVYTRDL NRGIRMGERL ETGMLGLNTG VISNAAAPFG GVKQSGLGRE GGHEGIEEYL
DTQYIGVADP YAS