Gene Noca_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4033 
Symbol 
ID4596547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4256690 
End bp4257712 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content72% 
IMG OID639778639 
Productethanolamine ammonia-lyase light chain 
Protein accessionYP_925217 
Protein GI119718252 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4302] Ethanolamine ammonia-lyase, small subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.157307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGACCG ACGAGCTCCG CAGCATCGTC GCGGAGGTCC TCGCCGAGCT TGCGGAGCCC 
GGTGACGCCT TCGCCCGCCT CACCACGCCC GCCACGACCG CCGGGCCCTC CGGGCCCACG
TCGACGCCGG CACCGGAGGA GTCCGACGCG CCATCGTCGG CGGCCACCGA GCCCGCGGCG
GTGCCCGCGT CGAGCGCGAC GGAGATCACC CGCCCCACCC TGTCCGGTGC GCCAGTGAGC
ATCGAGGTCT CCGACCCCAC GGTGCCGGAG GCGCGCCACC GGATCGGGGT CGAGAACCCC
GCGAACCCGA GCGGGCTCGC GAACCTCGCC GCCTCGACCG CCGCCCGGAT CGCCGTCGGC
CGGGCGGGCC CGCGGCCGCG CACCGAGAGC GTGCTGCTGT TCGGCGCCGA CCACGCGGTG
ACCCAGGACG CGATCTTCGG CGACGTGCCC ACCGCGCTCC TGGACCAGTT CGGGCTGTTC
GCTGTGCAGA CCAAGGTCAC GACGCAGGAC GAGTTCCTCC TGCGTCCCGA CCTCGGCCGG
GAGTTGGACG ACGCCGCCAA GCTCGTGGTC GCCGAGAAGT GTGTCAAGGG CCCGCAGGTG
CAGATCGTCG TCGGCGACGG CCTCTCGGCC GCCGCGGTGA CCAACAACCT GCCGCAGATC
TACCCGGTGC TGGAGGCGGG CCTGCGCGAC GCCGGCCTGA CCCTGGGCAC GCCGTTCTTC
GTGCGGTACT GCCGGGTGGG GGTGATCAAC GACATCAACG ACATCGTCGG GGCGGACGTG
GTCGTGCTCC TCATCGGGGA GCGTCCCGGG CTCGGGGTCG CGGATGCGCT GAGCGTCTAC
TCCGGATGGC GTCCCACCGC CGGCAAGACC GACGCCCACC GCGACGTGAT CTGCATGATC
ACGCAGAACG GCGGCACGAA CCCGCTGGAG GCCGGCGCCT TCGCGGTCGA GCACGTCAAG
AACGTCATGA AGCACCAGGC CAGTGGCGTC GAACTGCGAC TCCAAGAGAG CGGGACCCGC
TGA
 
Protein sequence
MSTDELRSIV AEVLAELAEP GDAFARLTTP ATTAGPSGPT STPAPEESDA PSSAATEPAA 
VPASSATEIT RPTLSGAPVS IEVSDPTVPE ARHRIGVENP ANPSGLANLA ASTAARIAVG
RAGPRPRTES VLLFGADHAV TQDAIFGDVP TALLDQFGLF AVQTKVTTQD EFLLRPDLGR
ELDDAAKLVV AEKCVKGPQV QIVVGDGLSA AAVTNNLPQI YPVLEAGLRD AGLTLGTPFF
VRYCRVGVIN DINDIVGADV VVLLIGERPG LGVADALSVY SGWRPTAGKT DAHRDVICMI
TQNGGTNPLE AGAFAVEHVK NVMKHQASGV ELRLQESGTR