Gene Noc_1851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1851 
Symbol 
ID3705115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2102755 
End bp2104470 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content57% 
IMG OID637738331 
ProductTPR repeat-containing protein 
Protein accessionYP_343848 
Protein GI77165323 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTATC TAGGGTTAAA AATAGGGTTA TTGAGTCTCT TTCTATCATT ATTGGCAGGA 
TGCGAACAGC GGGTACCCGT CGTTGCCAGC GAAGGCCAGC AGAGCAGCTT TGCCGCTGAT
TCGGCCACGA CGGCGAGCGA AGATGCGCCA CCGGGAGATA TTGATCCCCT GTTGGATAAT
CTGGGCGATC ATCATCATCC CGTTACTACC TCTTCTTCCC TGGCTCAGCG TTATTTCGAT
CAGGGCTTAA CCCTTGCGTT TGCCTTCAAT CATGCCGAGG CTATCCGCTC CTTTAAGGAC
GCCGCCACAA TCGATCCGGA TTGCGCTATG TGCTATTGGG GGGTTGCCCT TGCGCTCGGA
CCTAATATTA ATGCGCCCAT GGAGGCGGCG GCTGTCCCGC AGGCTTATGA GGCCGTTCAG
AAAGCGCTGG CGCTGGCGCC TAAGGCCAAT AAAGCGGAGC AAGCCTATAT CCAAGCGCTG
GCGATACGTT ATGGGCCTAC CTCTGGAGCT GATCGGGAAG GACTGGATCG GGCCTATGCC
GATGCCATGA GGGAGCTGTC GCGCCGTTAC CCCGATGACT TGGATGGGGC AGTGATTTTT
GCCGAGGCCC TGATGAATCT CACGCCATGG GAGTACTGGA CTCCCGCAGG GGAGCCGACG
GCCCATACCC AGGAAATCAT AGCCACCCTG GAGTCGGTAT TAGAGCGCGA CCCCAATCAT
ATTGGGGCTA ATCATTATTA TATTCATGCT GTCGAGGCAT CCCCGGCGCC GGAGCGGGCC
CTTCCTAGCG CCAAGCGGTT AGGGCAGCTA GCGCCGGGCG CTGGTCATCT GGTTCATATG
CCTGCCCATA TTTACTGGCG GGTGGGGGAT TACCATGCAG CGGTGACCGC CAATGAACAT
GCTATCCATA CGGATGAAGA ATATCTCCCC GACCCAGATG CCGAGGGGTT ATACCGGCTC
GGTTATTACC CCCATAACAT CCATTTCCTA TTTGCCGCGG CGCAGATGGA AGGCAACAGC
CAGCTGGCCT TGGAAGCGGC CCGCAAACTA GTGGCCAGTA TTCCCGAAGA ATCCTACTCG
ACCTTACCTC AATTGGAGGA ATTTAGGCCC ATGCCTCTCT ACGCCTTGGT GCGGTTCGGG
AAATGGGATG AAATTCTCCG TGAACCCAAG CCCGGTGCCT TCTTTCGATA CACGCGGGGA
ATCTGGCATT GGGCCCGGGG CATGGCGCTC ACGCGTCTGG GTCAGCTTGA TTCCGCAGCC
CAGGAATATG AACAATTAAC CAAAATCGGG CAGTCCCAGG CCATGGCTCA ACTAGTTTTT
TGGTCTGCTT CTTCCGGCTC AACCTTGCTA GAGATTGCCG CCCATATTCT CGCCGGAGAA
TTAGCCGGCG CCCGGGGCCA GACCGAAGCA ATGATCGCCC CCCTCAGGGA AGCGGTGGGC
ATTCAGGATA ATCTGCGGTA TATCGAACCG CCGGCTTGGT ATTACCCGGT GCGCCATAAT
CTGGGCGCGG CGTTGCTGAA GGCGGATCGG GCCGTAGAAG CCGAAGCCGT TTACCGGAAA
GATTTAAAGC AATATCCCCA AAATGGCTGG TCGCTCTTTG GCTTGGCCCA AAGCCTCCGT
GAGCAAGGCC AAACCGAAGC GGCGGCAACG GTGGAAAAGC GCTTTGAGGA GGCTTGGCAG
CACGCCGATG TGGACTTGAG GGCTTCCCGC TTTTGA
 
Protein sequence
MSYLGLKIGL LSLFLSLLAG CEQRVPVVAS EGQQSSFAAD SATTASEDAP PGDIDPLLDN 
LGDHHHPVTT SSSLAQRYFD QGLTLAFAFN HAEAIRSFKD AATIDPDCAM CYWGVALALG
PNINAPMEAA AVPQAYEAVQ KALALAPKAN KAEQAYIQAL AIRYGPTSGA DREGLDRAYA
DAMRELSRRY PDDLDGAVIF AEALMNLTPW EYWTPAGEPT AHTQEIIATL ESVLERDPNH
IGANHYYIHA VEASPAPERA LPSAKRLGQL APGAGHLVHM PAHIYWRVGD YHAAVTANEH
AIHTDEEYLP DPDAEGLYRL GYYPHNIHFL FAAAQMEGNS QLALEAARKL VASIPEESYS
TLPQLEEFRP MPLYALVRFG KWDEILREPK PGAFFRYTRG IWHWARGMAL TRLGQLDSAA
QEYEQLTKIG QSQAMAQLVF WSASSGSTLL EIAAHILAGE LAGARGQTEA MIAPLREAVG
IQDNLRYIEP PAWYYPVRHN LGAALLKADR AVEAEAVYRK DLKQYPQNGW SLFGLAQSLR
EQGQTEAAAT VEKRFEEAWQ HADVDLRASR F