Gene Nther_0639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0639 
Symbol 
ID6315190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp661714 
End bp662946 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content39% 
IMG OID642643022 
Productdiaminopropionate ammonia-lyase 
Protein accessionYP_001916822 
Protein GI188585277 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01747] diaminopropionate ammonia-lyase family
[TIGR03528] diaminopropionate ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.604891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTTAA CTTGTAGGAA TCAGTACCGA TCCGAAAAAC TCCTTGTCAA TCAAAACCAA 
TTAGATTACA TAGAACAAGT AGTTAAATTT TATCAAAAAA TCACTAATTA TCAACCTACA
CCTCAAATCT CTTTACATGA CATGGCGAAT TCCATAGGCG TAAAAAATAT CTTTGTTAAA
GATGAATCTT CTAGATTAGG TTTGGATTCT TTTAAGGTCC TTGGCAGTTT GTATGCCGTT
GCAAATATAA TAGCCGAGTA CCTGGGAGAA GACCTGTCTC AACTTGATGA ACAGGAGCTA
CAAAGTCGTA AAGTAAAAGA ACGGGTGGGT CACTTAACTT TTGTGACAGC CACTGATGGG
AATCATGGTA AAGGATTGGC TTATGCCGCC AATTTTTTTG GGCATAATGC AGTTGTGTAC
TTACCTAAGG GTAGTGACAA TGATAGAGTC AAGGCAGTTG AACAAGCCGG AGGTAAAGCT
TATGTCACTG AAGCTAATTA TGATGATACC GTAATCTTTG CTTCCCAAAA AGCCCAGGAA
GAAGGTTGGA TTTTAGTACA AGATACTGCC TTTGATTCTT ACACTAAAAT ACCGGGCTGG
ATAATGGAAG GCTATTCAAT GATAGCCAAG GAAATAGTGG ATTATTTTAA TGCTCAAGAA
TCAAGTCAAT TCCCCACCCA TCTGATTATT CAGGCAGGAG TTGGTTCTCT GGCGGGAGGG
GTATTAGATT ACCTGGTCAA TAGATTAGGA GAACAAATTC CCAATATAAT TGTGGTTGAA
CCGGAAAATG CCGCCTGTAT GCTTAATTCG GCCTTGGAAG AAGGGGGTAC AGCTAAAAGA
ATATTTGGTG ACTTGGATAC TATTATGACA GGTTTGTCTT GTGGTGCTCC TAATCCCTTA
GGGTGGAAAG GAATAAAAGA TGCAACTAAT ACATTCATAT CAGTACCGGA CTGGGTAGCA
GCCAGAGGGC TTAGAATTTT ACATAATCCT CATGGAAGGG ATCCTATAGT GCAAGCTGGT
TTTTCTGGAA GCCCTGGTAT AGGATTGCTT TCATTATTTG AATTCGATCA TTTTACTGGA
TTAAAGGATT GGTTGGAAAT AGATGAAGAA TCAGTAGTTT TAACTATAAA TACTGAAAGT
GTAACAGACC ATGGTAATTA CAAAAGTGTT ATGTGGGATG GCCATCCTTG TACTCCGGTA
AATGGAGATT TTGACTGGAA AGCGCTTTTA TGA
 
Protein sequence
MILTCRNQYR SEKLLVNQNQ LDYIEQVVKF YQKITNYQPT PQISLHDMAN SIGVKNIFVK 
DESSRLGLDS FKVLGSLYAV ANIIAEYLGE DLSQLDEQEL QSRKVKERVG HLTFVTATDG
NHGKGLAYAA NFFGHNAVVY LPKGSDNDRV KAVEQAGGKA YVTEANYDDT VIFASQKAQE
EGWILVQDTA FDSYTKIPGW IMEGYSMIAK EIVDYFNAQE SSQFPTHLII QAGVGSLAGG
VLDYLVNRLG EQIPNIIVVE PENAACMLNS ALEEGGTAKR IFGDLDTIMT GLSCGAPNPL
GWKGIKDATN TFISVPDWVA ARGLRILHNP HGRDPIVQAG FSGSPGIGLL SLFEFDHFTG
LKDWLEIDEE SVVLTINTES VTDHGNYKSV MWDGHPCTPV NGDFDWKALL