Gene Noc_0194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0194 
Symbol 
ID3706228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp212129 
End bp213991 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content55% 
IMG OID637736712 
Productarylsulfatase A and related enzyme 
Protein accessionYP_342257 
Protein GI77163732 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA GCAATAATCT ATCGCGCCGG CAATTTCTGA AAACGACCGG TGCCGTGGCG 
ATGGCTAGCA GCGTAGCTGG TTTCAGTGAT GTGCTGGCGG TGCGAGACTG GTCGCATCCA
GGCCGCGGCC TCCGGGGGCG ACCCAATATC CTGTTGATGC TGGTCGACGA GATGCGTTAC
CCGCCAGTGT TCGAAGGGTT GGGTGCTCAA CAGTTTCGTC AAACCTATCT TAAGACCCAG
AATGCTTTGC GTGCGAGCGG CGTCGAGTTT CATCGTCACT ACGCCGCTGC CACCGCCTGC
GCGCCCAGTC GCGCTTCGAT CTTCACCGGA CACTATCCGT CGCTGCATGG TGTAACTCAG
ACTACTGGTG CCGCCAAGGA AGAAAATGAC CCAGACGTGT TCTGGCTCGA CCCTGCCAGT
GTTCCCACCA TGGGCGATTA TTTCCAGGCC GGTGGCTATC GCACGTTCTA CAAGGGCAAG
TGGCATGTGT CCAATGCTGA TTTGCAGATT CCAGGCACGC ATGATCAGCT TCTTAGCTAT
GACGACCAGG GCAATCCTGA CCCCGGCAAG CAGCAACTAT ACCTTGAAGC CGACCGGCTA
GCAGATTATG GTTTTGAGGG CTGGATCGGA CCTGAGCCTC ATGGTAAAGC GCCGCTTAAT
ACCGGTTCTA GCCCAGCCCA AGGCCAAGGC CGGGATGTTG GTTTTGCTAC CCAGGTGGTT
AATTTGATTC AGCAACTCGG CACCGAACGC CACAGCGCCC CGTGGTTGAC CGTAGCCTCC
TTGGTGAACC CCCACGACAT CGCCCTGTGG GGTTACGTCG CGCGCCATAC GGGGCTGTTC
AACTTCACAG TGGAGGATAT TGTTCCCGCG TTTACCGAGC TGTTCGACCC AGTGATGTTT
GCACAAACGC TCGCTGACGA CTTAACTACC AAACCATCCT GCCAGCAGAG TTATCAAGAG
TCATACAACG AATGGATGCA GGGAGTGCCA CCGCATGACT ATTTCCGGTT TTACTATCAG
CTTCACAAGA ATGTTGATGA CGAACTGTAC AAGCTCTATC AGGCCTTGCA GCAATCCCCC
TTTTATGACA ATACGATCGT GATCTTCACT TCCGATCATG GCGATCTGCT TGGCGCCCAT
CGCTATATGC ACCAAAAGTG GTACCAAGCC TACGATGAAG CGGTGCGCGT CCCATTGATC
ATCTCCAATC CACACTTGTT CCCAGAGCCG CGTTCCATTG ACAGCGTCAC CAGCCACGTG
GATCTGCTGC CGACCCTGCT GAGCCTTGCC CGGCTCAAGC AGGCCAGGCT GCGCCGCAAG
GTTGCCAAAG GCCACAGCGA TCCGGTGCCC CTAGTCGGGC GCAATCTCAG ACGGCTAGTA
CTTGGCCGTA ATCGCCGGCC AGTCGCTGAT CCCGTCTATT TCATGACCGA TGACGATATG
AGCCGCGGTC TCGATCAGGA AAACTTTATC GGTATCGCCT ACGGGTCGGT GATTCAACCG
AGTCACGTGG AGACCGTCAT CGTAGAGATC GACGGCGAGG TCTGGAAGTA CAGTCGCTAC
TTCGATAACA AACAATTCTG GAGCGATCCG AGCCAGCCCA AGGATGTCGT GACCCAAGTA
GAGAATAAGC TCATTGACCC GCCGGCCGGC ACCTACGATG TTAATGCGAC CCAGAGCTTC
AAATACGAGC CAGAGCCTGA TGAGTACGAA ATGTACAATG TTACCCAGGA TCCGATGGAA
CTCGATAATT TGTACGGCAA TCTCGTCTAT GCGGCGATGC AGACCCACTT GGCAACACTG
CTAGACCAGC AGCGCGCTCA AAAGCGTCTT ACACCGATCA GCGGCGTCGT TCCAGGTCAG
TAA
 
Protein sequence
MAKSNNLSRR QFLKTTGAVA MASSVAGFSD VLAVRDWSHP GRGLRGRPNI LLMLVDEMRY 
PPVFEGLGAQ QFRQTYLKTQ NALRASGVEF HRHYAAATAC APSRASIFTG HYPSLHGVTQ
TTGAAKEEND PDVFWLDPAS VPTMGDYFQA GGYRTFYKGK WHVSNADLQI PGTHDQLLSY
DDQGNPDPGK QQLYLEADRL ADYGFEGWIG PEPHGKAPLN TGSSPAQGQG RDVGFATQVV
NLIQQLGTER HSAPWLTVAS LVNPHDIALW GYVARHTGLF NFTVEDIVPA FTELFDPVMF
AQTLADDLTT KPSCQQSYQE SYNEWMQGVP PHDYFRFYYQ LHKNVDDELY KLYQALQQSP
FYDNTIVIFT SDHGDLLGAH RYMHQKWYQA YDEAVRVPLI ISNPHLFPEP RSIDSVTSHV
DLLPTLLSLA RLKQARLRRK VAKGHSDPVP LVGRNLRRLV LGRNRRPVAD PVYFMTDDDM
SRGLDQENFI GIAYGSVIQP SHVETVIVEI DGEVWKYSRY FDNKQFWSDP SQPKDVVTQV
ENKLIDPPAG TYDVNATQSF KYEPEPDEYE MYNVTQDPME LDNLYGNLVY AAMQTHLATL
LDQQRAQKRL TPISGVVPGQ