Gene Noc_1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1687 
Symbol 
ID3705600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1888053 
End bp1889726 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content49% 
IMG OID637738168 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_343689 
Protein GI77165164 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00296418 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAGT ATGTTTATAG TATGAACCGG GTGGGCAAGG TAGTGCCGCC GAAACGCGTT 
ATTTTGCGCG ATATCTCCCT ATCCTTTTTT CCAGGTGCAA AGATTGGCGT GCTAGGTCTG
AATGGGGCAG GAAAATCAAC CGTGCTCAAA ATTATGGCCG GAATTGACAA AGATATTGAA
GGCGAAGCTA TTCCCCAAAA AGGACTTAAG ATCGGCTACC TTTCTCAAGA GCCGCATCTG
GATCCTGCTA AAAATGTCCG GGATAACGTG GAGGAGGGAA TTGCCGAAAC TAAAGCTATG
TTGGAGCGAT TCAACGAGAT TGGCCTGTTA TTTGCCGAGC CAATGAGTGA TGAAGAAATG
AATCGCCTCT TTGAGGAACA GGCACAGCTT CAAGACGCTA TTGAAGCTGC GGATGCCTGG
AATCTAGATC ATAAGCTTGA TATCGCTGCC GAAGCGTTGC GCCTCCCTCC TTGGGAGGCA
GAGGTCACCC ATCTTTCGGG CGGGGAACAG CGGCGTGTGT CCCTTTGCCG CTTGCTCCTT
TCTGAGCCAG ATATGTTGCT GCTAGACGAA CCTACTAATC ATCTTGATGC AGAGTCGGTT
GCTTGGTTGG AGCGCTACTT GGAAAAATAT CCGGGTACTG TCGTAGCTGT AACCCATGAT
CGCTATTTCT TGGATAATGT GGCCGGCTGG ATTCTGGAAT TAGATCGCGG CCACGGTATT
CCTTGGGAGG GAAATTACTC ATCTTGGCTG GAACAAAAAG AAAAACGCTT GCAATTAGAG
GAGAAGCAGG AGGGGGCCCG GATTAAAGCA ATAAAGGCGG AGCTCGAATG GGTTTCCGTA
AACCCAAAAG GGCGGCATGC CAAGAGCAAA GCCCGTCTCG CCCGTTTTGA AGAATTATCT
TCCCAAGAAT ACCAAAAACG CAACGAAACT AATGAAATCT ATATTCCACC AGGTCCACGT
CTGGGGGATA TAGTGATTGA AGCAAAGGAT CTGCGCAAGA GCTTTGGTGA CCGTTTACTT
ATTGATGAGC TAAATTTCAG CCTTCCTCCT GGGGGGATTG TGGGAATCAT CGGTGCCAAT
GGCGCGGGAA AAACAACTTT GTTTAGAATG ATGGTGGGCC AAGAGCAGCC GGATGCGGGT
GAAATTCGAC TAGGGGATAC AGTCAAGTTG GCTTATGTCG ATCAAGGCCG GGAGGCTTTA
AATGCTAGCA AAACCGTGTG GGAAGAGATT TCAGAAGGTC AAGACATTAT CAAGGTTGGG
GCTTATGAGA CTCCTTCCCG CGCCTACGTA GCGCGATTTA ATTTTAAAGG TTCAGATCAG
CAAAAACGTA TTGGAGATCT TTCTGGCGGT GAGCGTAATC GGGTGCATCT AGCTAAGTTG
CTTCGTGCTG GAGGAAATGT TCTTCTCCTT GATGAACCGA CCAATGACTT AGATGTGGAA
ACCCTAAGAG CCTTGGAGCA GGCTCTACTA GGTTTCCCCG GCTGTGCCGT AGTGATTTCC
CATGATCGTT GGTTTTTAGA TCGTATTGCT ACCCATATTC TCGCTTTTGA AGGGGATAGC
CAAGTTATTT GGTTCGAAGG CAACCATGCC GATTATGAAG CGAACCGTCG CCAGCGCCTG
GGTGAGATGG CCGATCAGCC TCATCGTATC CGTTACCAAC CCTTGTTTTC ATAG
 
Protein sequence
MAQYVYSMNR VGKVVPPKRV ILRDISLSFF PGAKIGVLGL NGAGKSTVLK IMAGIDKDIE 
GEAIPQKGLK IGYLSQEPHL DPAKNVRDNV EEGIAETKAM LERFNEIGLL FAEPMSDEEM
NRLFEEQAQL QDAIEAADAW NLDHKLDIAA EALRLPPWEA EVTHLSGGEQ RRVSLCRLLL
SEPDMLLLDE PTNHLDAESV AWLERYLEKY PGTVVAVTHD RYFLDNVAGW ILELDRGHGI
PWEGNYSSWL EQKEKRLQLE EKQEGARIKA IKAELEWVSV NPKGRHAKSK ARLARFEELS
SQEYQKRNET NEIYIPPGPR LGDIVIEAKD LRKSFGDRLL IDELNFSLPP GGIVGIIGAN
GAGKTTLFRM MVGQEQPDAG EIRLGDTVKL AYVDQGREAL NASKTVWEEI SEGQDIIKVG
AYETPSRAYV ARFNFKGSDQ QKRIGDLSGG ERNRVHLAKL LRAGGNVLLL DEPTNDLDVE
TLRALEQALL GFPGCAVVIS HDRWFLDRIA THILAFEGDS QVIWFEGNHA DYEANRRQRL
GEMADQPHRI RYQPLFS