Gene Noc_1344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1344 
Symbol 
ID3706147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1492417 
End bp1493937 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content51% 
IMG OID637737840 
Productextracellular solute-binding protein 
Protein accessionYP_343369 
Protein GI77164844 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.088373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGGAA CAAGAAACAT AGTCCGGTGG TGGCGAACGT TGGGTTTGTT GATTGGCATA 
GTTTTGCTCA TAGGCTGTAG TTCTGGTTAT GAGGACAGGA ATTCTTTACG TTTTGGCCTG
GCGACCTCAC CCCTTACCCT TGATCCCCGC TATGCCACGG ATGCCGCCTC AGCCCGTATT
ACCCGCCTCT TGTATCAACC CCTCGTTGAT TTGGATGCCA GCGGCAAGCC CATCCCCAGT
TTAGCCCATT GGCAACAACT GTCTCCGAGC CGCTACCGGT TTCGATTACA GCTTGATCGT
CCTCGCTTTC ACGATGGCAG CAAGCTTAGC GCTCAGGATG TGAAAGCGAC CTATACCTCG
GTACTTGATC CAGATAATGG CTCGCCTTTT TTAGCTTCCT TGGATAAAGT GCAGTCTATT
GCGGTGCCCA ATCCAGAAAC CATTGACTTC GTTTTGAAAG AACCTGATTC CCTGTTTCCA
GGGCGGCTTA CCCTGGGGAT AATCCCTGCT TCCCTAATAG CTGTGGGTCA CCCTTTGAAT
CGCATTCCGG TAGGCAATGG TCCCTTTCGC TTTAAGGCGT GGCCGGAAGA GGGCCGATTG
ATACTCCAGC GGCGACGAGA TAGACAGGGG TTTGTATTTG TTCATGTACC CGATCCCACG
GTGCGGGTAT TAAAATTACT CCGTGGGGAG ATCCACATGC TGCAAAACGA TCTTCCCCAG
GAGCTGGTGG CTTACCTGGA AGATAAGGAG GAAATTTCCC TGCAACGGGG TCGGGGGAAT
ACGTTTACGT ATCTAGGATT TAATCTCGAT GATCCGGTAA CAGGCCAACC TCAAGTGCGT
TTAGCGATCG CCCATGCTTT GGATCGTCAA AAGATCATTC GCTATGTTTT TAAAGGAGCA
GCCCGGCCAG CTCAGGCCCT GCTGCCGCCC GAGCATTGGG CAGGCCACCC GGCTTTACCC
CCGCACGCCT ATGATCCCCA GCGGGCCAGG AAACTGCTTA AGGAAGCTGG CTTCAATGCC
CATTCTCCGG CCCGCGTTAG CTATAAGACC TCTAGCGATC CTTTCCGGGT TCGCCTGGCC
ACCATTATTC AGCATCAACT GCAACAAGTG GGCATTCAGG TAGAGGTGCA AAGTTATGAT
TGGGGAACCT TTTACGGCGA TATTAAGGCC GGGCGTTTCC AAATGTATAG CCTATCCTGG
GTAGGCGTTA AATTGCCGGA TGCCTTTCAT TACATCTTTC ATAGCGAGTC GGCTCCACCT
CAAGGGGCGA ACCGAGGCCA TTTTAACGAT CCTCAAAGTG ATTTTTTAAT TGAACGGGCG
GGAAATACAG ACAATCAAAA CCAGCAAGCC AATCTCTACC GCCGGTTACA AGCACGCCTT
CTGGAACAAC TGCCTTATGT GCCTCTTTGG TACGAAGATA ACGTGTTCGC AGCCCATCAA
GGTATCAGTC ATTATCAACT AGCACCCGAT GGGAATTATG ATGGCTTGGT GGAAGTGCGG
TTCCATCGGT ACGTGAAGTA A
 
Protein sequence
MRGTRNIVRW WRTLGLLIGI VLLIGCSSGY EDRNSLRFGL ATSPLTLDPR YATDAASARI 
TRLLYQPLVD LDASGKPIPS LAHWQQLSPS RYRFRLQLDR PRFHDGSKLS AQDVKATYTS
VLDPDNGSPF LASLDKVQSI AVPNPETIDF VLKEPDSLFP GRLTLGIIPA SLIAVGHPLN
RIPVGNGPFR FKAWPEEGRL ILQRRRDRQG FVFVHVPDPT VRVLKLLRGE IHMLQNDLPQ
ELVAYLEDKE EISLQRGRGN TFTYLGFNLD DPVTGQPQVR LAIAHALDRQ KIIRYVFKGA
ARPAQALLPP EHWAGHPALP PHAYDPQRAR KLLKEAGFNA HSPARVSYKT SSDPFRVRLA
TIIQHQLQQV GIQVEVQSYD WGTFYGDIKA GRFQMYSLSW VGVKLPDAFH YIFHSESAPP
QGANRGHFND PQSDFLIERA GNTDNQNQQA NLYRRLQARL LEQLPYVPLW YEDNVFAAHQ
GISHYQLAPD GNYDGLVEVR FHRYVK