Gene Noca_4100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4100 
Symbol 
ID4596614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4331326 
End bp4332777 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content71% 
IMG OID639778706 
Productamidohydrolase 
Protein accessionYP_925284 
Protein GI119718319 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGGACC AGCCGGCCGG CCCGGAGGTC GAGACCGCTG ACCTGCTCCT ACGCGGTGCC 
ACGGTCGTCA CGATGGACGC GGACCGCACG GTCTACGAGA GGGGCTACGT CGCCGTGCGC
GGCCAGGAGA TCCTCTCGGT CGGGGCCGAC GACGGCGACG TGCCCCGCGC TCGGGAGGTC
CGCGACCTCG ATGGACACGT GGTGTTGCCG GGGCTGGTCA ACTGCCACAC GCACCTGTCG
AACGGCATCT CCCGAGGCCT CTTCGACGAG TTGCCCCTCG CCGACTGGGT GGAGAAGGGG
ATGTGGCCCT CGTTGCGCGC CAACACCCGG GAGGCGACGT ACCACGGGGC GCGGGTCGCC
CTGGCCGAGA ACCTGCTCGG CGGCGTGACG ACCACGGTCG TGGGTGAGTT CGGCGTCCCC
GCCCGCGACA CGCTCGACGG GGTGTTGGCG GCCGTCACCG AGTCCGGTTC GCGCTCGGTC
GTGGCCCGCA TCTCGGTGGA CTCCGCCGAC GACCACGACT CCAGTCAGGC CGTCCCCCCT
GACGTTCGCG AGGACATCGA CGCGGCGTTG GCCGAGGTGG ACCGGCTGCG ATCCGGCTAC
GGCTCGGACC TCCTCGAGGT GGTCCCTGAA GCCCTCGGCG TGCTTCGCTG CTCGGCGGAC
ATGGTCACGG AGTTCGCTCG CTACGCCCGG GACCGGGGCA CCCGGATGAC GATGCACGTC
GCGAGCTCTC CCGACGAGCG CGACGAGGCG CAGTACCGCT TCGGCAAAGG GTCCGTCGAG
CGCCTGCACG ACCTGGGTGT CCTCGGGCCG CACCTGTTGG TCGCCCACTG CGTGTGGAAC
GACGACCGCG AGCGTGCACT GCTCGCCGAG AGCAGGACCG GGGTCTCCCA CAATCCCGTG
GCGAACCTGA TGTACGCCTC GGGTCTGGCA CCCCTCTCGG AGATGCTCGA AGCAGGCGTG
CGAGTGGGAC TCGGCACCGA CGGGGCGTCC ACCAACAACG GCCAGAACAT GTGGGAGGTC
ATGAAGACCG CCATGTTCCT GCAGAAGTCG CGCTTCGGCG CCGGGTGGGG ATCGGCCGAG
CTCGCCCTGG AGCTGGCCAC TCTCGGTGGA GCGCGGGCCA TCGGCATGGA GGACCGCATC
GGCTCGCTCG GAGCCGGCAA GCGTGCCGAC ATCGTGGTGG CCACGTTGAA CAAGCCGGAG
CTGGTCCCGC ACGCCACCTG GCCGTCGAAC CTGGTCTACT CCTTCAGCCC GAGCGCGGTG
CGGACCGTGC TGGTCGACGG GCGCGTGGTG GTCGACGACG GTCGAGTCGT CGCGTGGGAG
CACGACGACG TCATCGCGCA CGGCAACCGG ATGGCCCTCG AGATGGACGC CCACACCGGT
CTGGCCCGGG CCTACCGGCA GCGGAGCCGC TGGCGCTGGG TGGGCGAGCG AGGCGGTCAG
CCGGCCTCCT GA
 
Protein sequence
MTDQPAGPEV ETADLLLRGA TVVTMDADRT VYERGYVAVR GQEILSVGAD DGDVPRAREV 
RDLDGHVVLP GLVNCHTHLS NGISRGLFDE LPLADWVEKG MWPSLRANTR EATYHGARVA
LAENLLGGVT TTVVGEFGVP ARDTLDGVLA AVTESGSRSV VARISVDSAD DHDSSQAVPP
DVREDIDAAL AEVDRLRSGY GSDLLEVVPE ALGVLRCSAD MVTEFARYAR DRGTRMTMHV
ASSPDERDEA QYRFGKGSVE RLHDLGVLGP HLLVAHCVWN DDRERALLAE SRTGVSHNPV
ANLMYASGLA PLSEMLEAGV RVGLGTDGAS TNNGQNMWEV MKTAMFLQKS RFGAGWGSAE
LALELATLGG ARAIGMEDRI GSLGAGKRAD IVVATLNKPE LVPHATWPSN LVYSFSPSAV
RTVLVDGRVV VDDGRVVAWE HDDVIAHGNR MALEMDAHTG LARAYRQRSR WRWVGERGGQ
PAS