Gene Noca_3820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3820 
Symbol 
ID4595885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4040538 
End bp4041734 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content73% 
IMG OID639778428 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_925007 
Protein GI119718042 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.965003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGGTCG TCGACGGCTG CCGGCCCGAC GAGATCGACA GCGGCCTGAC GCCGAACCTG 
GCGGCGCTGC GCGACGGTGG GCTCCGGTTC CCGCAGGCGT CGTCGATGCC GGTCATGGAG
ACGATCCCCA ACCACGTGAT GATGATGACC GGGCTGCGCC CCGACCGCAC CGGGGTGCCC
GCGAACTCCG TCTTCGACCG CGGGCTCGGC GAGGTGCGCA CCCTCGACCG GCCCTCGGAC
ATCCGGTGCG GAACGCTGCT GGGCCGGCTC GGCCGGCGCG GCCTCACCAC CGGCACGGTG
CTCTCCAAGA CCTACCTGTA CGGCGTCTTC GGCGGCCGTC CCACACACCG CTGGGAGCCC
AGCCCGACGC TGCCGATCAC CGACCACGCC CCCGACGCGC TCACCATCGA CGCCGCGATC
ACGATGCTCG AGGAGTACGA CCCGAACCTG ATGTTCGTCA ACCTCGGCGA CATCGACCGG
TTCGGGCACG CCGACCTCAC CGGCACCACG CTGCGCGTCG CCCGCCGGCT CGCACTGGCC
GACACCGACC TGCAGGTCCA GCGGTTCCTC GACGCGCTGA AGGCCCAGGG GCTCTGGGAC
CGGTCGATCG TGATCGTGCT GGCGGACCAC TCGATGGACT GGTCGACCCC GGACCGGTTG
ATCGGCCTGA CCGGGCCGCT CACCGCAGAC CCGCTGCTCG CCGGGCGGGT CCAGATCGCC
GACAACGGCG GCGCCGACCT CCTGTACTGG ACCGGCCCCG ATACCCAGCG CGCCGAGGCG
ATCGAACGGA TGCGGACCAT CGCGCGGGCC CAGGAGGGGG TGCTCGCGGC GTACGCACGC
ACCGCCCCCT GGCTGCGCCT GGGACCGGAG GCCGGTGACG TCGTAGTGTT CTGCCAGGCC
GGCTGGCGGT TCAGCGAGCC GGACCCCACC GCGAACCCGA TCCCCGGCAA CCACGGCCAC
CCGGCCACCC GGTCGATCCC GTTCTTCGTC GGCGGCGGCC ACCCCGACGT ACCCCGACGC
ACCGCGTCCT CGCGGGTCGC CCGCACCATC GACGTCGCCC CCACCGTCGC CGCGTTCTTC
GGCGCCGGCG CGCCGAAGGG CGGGTACGAC GGCCGCAACC TGCTCCCCCG CACCCCACGA
CAGCAACCGG TGATCGAGAT CGTGGAGGTC CCCGCACCCC ACGCCGGCCA CCGGTGA
 
Protein sequence
MLVVDGCRPD EIDSGLTPNL AALRDGGLRF PQASSMPVME TIPNHVMMMT GLRPDRTGVP 
ANSVFDRGLG EVRTLDRPSD IRCGTLLGRL GRRGLTTGTV LSKTYLYGVF GGRPTHRWEP
SPTLPITDHA PDALTIDAAI TMLEEYDPNL MFVNLGDIDR FGHADLTGTT LRVARRLALA
DTDLQVQRFL DALKAQGLWD RSIVIVLADH SMDWSTPDRL IGLTGPLTAD PLLAGRVQIA
DNGGADLLYW TGPDTQRAEA IERMRTIARA QEGVLAAYAR TAPWLRLGPE AGDVVVFCQA
GWRFSEPDPT ANPIPGNHGH PATRSIPFFV GGGHPDVPRR TASSRVARTI DVAPTVAAFF
GAGAPKGGYD GRNLLPRTPR QQPVIEIVEV PAPHAGHR