Gene Noc_1493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1493 
Symbol 
ID3705984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1653266 
End bp1655080 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content55% 
IMG OID637737980 
Productpeptidase M61 
Protein accessionYP_343509 
Protein GI77164984 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.333369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTTTA TTCATTATCG CATTATTCCC AAAAATCCCC AGGCCCACCT GTTCCAGGTG 
ACGCTTACTA TTTTAGAGCC TGCTCCCGAA GGCCAGTGTT TGTCGCTACC GGCATGGATT
CCTGGGAGCT ATTTGATCCG CGATTTTGCC AAGCACGTGG TCCAGCTTAG GGCCGAATCC
CAAGGGCATC CCCTTCCCGT CGAAAAATTG AGCAAATCTC TCTGGCAATG CGCGCCGAGC
GGGGGGCCAG TGACAATTAA CTATGAGGTC TATGCCTGGG ACAGCTCTGT GCGGGCCGCC
CATCTGGACA CCTTCCATGG TTTTTTCAAC GGAACCAGCG TTTTTCTGCG GGTAGAAGGG
CAGGCTCATG AACCCTGCAC CGTCACCCTT TTACCGCCAG AGGGGGAAAC TTACCGCCAC
TGGCGGGTGG CTACCGCTTT ACCTCGGGCT GGCGCCGAGC CGTATGGTTT TGGCGACTAT
CAGGCGGGGG ACTATGAGGA ACTGGTTGAT CATCCCGTGG AAATGGGTGA GTTTAGCCTG
GTTACCTTCG AGGCTTGCGG CGTTCCCCAC GATATCGCCA TCACCGGCCG TCATCGGGCT
GATATGGAAC GTCTGAGCCG GGATCTGAAG ATCTTATGCG AGCACCATAT CCGCTTCTTC
GGTGAACCAG CGCCTATGGA TCGGTATGTC TTTTTGGTGA CCGCCGTGGG CGAAGGATAT
GGGGGACTGG AACATCGAGC TTCGTGTGCT TTGTTATGCA ATCGGAGCGA CTTGCCCCAA
GCAGGAGAAA CCGAAGTTGG GGAAGGTTAC CGGAATTTTC TTGGCCTATG CAGTCATGAA
TATTTCCATA CCTGGAATAT CAAGCGTATC AAGCCCGCCG CTTTTGTCCC TTACGATCTT
CAGCAAGAGA ATTACACCCG CCTGCTTTGG GCTTTTGAAG GGATTACCTC CTATTATGAC
GATCTGGGGC TGGTGCGCTC GGGGCTGATT AGCCAAGAGA GTTATTTGGA GTTGCTGGGA
CAGACCATCA CCCGGGTGTT GCGAGGCTCG GGCCGGCTTA AGCAAAATCT AGCCGAATCC
AGCTTTGATG CCTGGACGAA ATTTTACCAG CAGGATGAAA ATGCTCCCAA TGCTATCGTG
AGTTACTATA CCAAGGGTGC ATTGGTAGCC CTGGCGCTGG ATCTAACCCT GCGGTGGGAA
ACCCACGGCG AGTGTTCCTT GGATGGGGTG ATGCGGGCAT TATGGGAAAG TTATGGTAAG
ACGGGGGTAG GCGTGCCTGA AGACGGGGTA GAACGTCAGG TTGCCGAGGT TTCAGGATTG
GATATAACGG ATTTCTTTGA AGTAGCGCTG CGGCGGGCGG AGGATCTTCC CTTGCAAGCG
CTGTTGGCCC AGGTAGGGAT TTGCTATGCC CTGCGCGTCC CGGAATCCAG TGATGATAAA
GGCGGCAAGC CTGGCAAAGG GGGCACGCCT CGCGCTAGGC TAGGCATCCG CTTGGTTCCG
AATGAAAAAG AGGCCAGGAT CAGCCAAGTC TTCGATGAGA GCGCGGCTCA ATGGGCCGGG
CTCTCAGCCG GGGATAGCCT CATTGCCGTA GATGGTATCA GGGTGACCGC CAGCAATTTG
GAGAAAGTCA TTAGCTCTTA TCCAGGAGGA GCGAGAGTTA TCATCCATGC TTTTCGGCGG
GATGAATTGC GGGAGTTTGA GGCTGCTTTG CAACTGCCTC CCAAGGATAC CTGCGTGCTG
ACTATTGATG AAGACGCCTC CCCAACCGCT GTAGTGGCCC GGGAGGCTTG GTTATTGGAG
TTTCGCCATG ACTGA
 
Protein sequence
MSFIHYRIIP KNPQAHLFQV TLTILEPAPE GQCLSLPAWI PGSYLIRDFA KHVVQLRAES 
QGHPLPVEKL SKSLWQCAPS GGPVTINYEV YAWDSSVRAA HLDTFHGFFN GTSVFLRVEG
QAHEPCTVTL LPPEGETYRH WRVATALPRA GAEPYGFGDY QAGDYEELVD HPVEMGEFSL
VTFEACGVPH DIAITGRHRA DMERLSRDLK ILCEHHIRFF GEPAPMDRYV FLVTAVGEGY
GGLEHRASCA LLCNRSDLPQ AGETEVGEGY RNFLGLCSHE YFHTWNIKRI KPAAFVPYDL
QQENYTRLLW AFEGITSYYD DLGLVRSGLI SQESYLELLG QTITRVLRGS GRLKQNLAES
SFDAWTKFYQ QDENAPNAIV SYYTKGALVA LALDLTLRWE THGECSLDGV MRALWESYGK
TGVGVPEDGV ERQVAEVSGL DITDFFEVAL RRAEDLPLQA LLAQVGICYA LRVPESSDDK
GGKPGKGGTP RARLGIRLVP NEKEARISQV FDESAAQWAG LSAGDSLIAV DGIRVTASNL
EKVISSYPGG ARVIIHAFRR DELREFEAAL QLPPKDTCVL TIDEDASPTA VVAREAWLLE
FRHD