Gene Noca_4004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4004 
SymbolgabD2 
ID4598139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4225593 
End bp4227152 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content70% 
IMG OID639778609 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_925188 
Protein GI119718223 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACTCC AGCGACCGGC GTCGATCACC GACGCGTTCC TCGAGCGGCT CGTGGCCCGC 
GTGCCGTCGA CCAGCGGCGG CACCTGGAAG CTCACCGAGG TCTACACCGG TGACCTGCTC
GTGGAGCTCC CGCAGTCGAC ACCCGCCGAC ATCGAGGCGG CGTTCGCGGC GGCGCGCGCG
GCCCAGCGCT CGTGGGCGGC CCGACCGCTC AAGGAGCGGC TCGAGGTGTT CAAGCGCGCG
CACGCCCTGT TCCTCGACAA CGCCCACACC ACGACCGACC TGATCCAGGT CGAGAGCGGC
AAGAACCGGC GGATGGCCAT CGAGGAGACC TGCGACCCGG TGATGGTGAT GAGCCACTAC
CTCAAGCGGG CCCCGCACCT CCTCAAGCCG GTCAAGCGCG GCGGGCCGAT CCCGTTCTTG
TCCAGCTCGA CCGAGATCCG CCAGCCCAAG GGCGTCGTCG GGATCATCGC GCCGTGGAAC
TTCCCGTTCG CGACCGGCAT CTCCGACTCG ATCCCGGCGC TGATGGCCGG CAACGCGATC
GTGCTCAAGC CGGACAACAA GACGGCGCTC TCGCCGCTGT ACGGCGTGCA GATGCTCGAG
GAGGCCGGCC TGCCGAAGGG GCTCTTCCAG GTGGTCTGCG GCGAGGGCCC GGACGTCGGC
CCGACGCTGA TCGACAACGC CAACTACGTG ATGTTCACCG GCTCGACCGC GACCGGTCGG
GTGATCGGGG AGCGGGCCGG GCGCAACCTA ATCGGCTGCT GCCTCGAGCT CGGCGGCAAG
AACCCGATGA TCGTGCTCGA GGACGCGGAC CTCGACGAGG TCGTGCAGGG CGCGATCTTC
GGCGCGTTCG GCAACACCGG CCAGATCTGC ATGCACATCG AGCGGATGTA CCTGCCCGCG
TCGAGGTACG ACGAGTTCCG CTCGCGGTTC GTCGCTGCGA CCGAGGCGCT GACCATCGGC
GCGGCGTACG ACTTCGGCCC CGACATGGGC TCGCTGGTCT CGCCGGACCA CATGGAGCGG
GTCCGGGGGC ACGTCGACGA CGCCGTGGCC AAGGGCGCCA CCGTGCTCAC CGGCGGCCGG
TCACGGCCCG ACCTCGGCCC GGCCTTCTTC GAGCCGACCA TCCTGGAGGG CGTCACCCAG
GACATGCTCT GCGGCGTCAC CGAGACCTTC GGCCCGGTCG TCGCGCTGCA CCGGTACGCG
ACCGTCGACG AGGCGATCGC ACTCGCGAAC GACACCGACT ACGGGCTGAA CGCCTCGGTG
TGGGGCGGCG ACATCGCCAG CGCCTGCCAG GTCGGCCAGC GGATCGAGAC GGGCAACGTG
AACGTCAACG ACATCCTCGC GACGGCGTAC GCGTCCAAGG GCACGCCCTC GGGCGGCGTC
AAGCAGTCCG GCGTGGGCGC CCGGCACGGC GACCAGGGCC TGCTGAAGTA CACCGACGTG
CAGAACCTCG CCGTCTTGAA GAAGCAGGTG ATGGGCGCGC GGCCCGGCCA GGACTACGAG
AAGTACGTCA AGGGGATGCT CAGCGGCCTG CGGATGATGC GCAAGACCGG CATCCGCTAG
 
Protein sequence
MALQRPASIT DAFLERLVAR VPSTSGGTWK LTEVYTGDLL VELPQSTPAD IEAAFAAARA 
AQRSWAARPL KERLEVFKRA HALFLDNAHT TTDLIQVESG KNRRMAIEET CDPVMVMSHY
LKRAPHLLKP VKRGGPIPFL SSSTEIRQPK GVVGIIAPWN FPFATGISDS IPALMAGNAI
VLKPDNKTAL SPLYGVQMLE EAGLPKGLFQ VVCGEGPDVG PTLIDNANYV MFTGSTATGR
VIGERAGRNL IGCCLELGGK NPMIVLEDAD LDEVVQGAIF GAFGNTGQIC MHIERMYLPA
SRYDEFRSRF VAATEALTIG AAYDFGPDMG SLVSPDHMER VRGHVDDAVA KGATVLTGGR
SRPDLGPAFF EPTILEGVTQ DMLCGVTETF GPVVALHRYA TVDEAIALAN DTDYGLNASV
WGGDIASACQ VGQRIETGNV NVNDILATAY ASKGTPSGGV KQSGVGARHG DQGLLKYTDV
QNLAVLKKQV MGARPGQDYE KYVKGMLSGL RMMRKTGIR