Gene Noc_0381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0381 
Symbol 
ID3706552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp422365 
End bp423633 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content61% 
IMG OID637736893 
ProductTPR repeat-containing protein 
Protein accessionYP_342437 
Protein GI77163912 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTCAG AAGCCGTCCC ATTACAGCTT ACCCGTATCC AGGCGCTCGA AGATGTGTTG 
CGGCAGCTCA AGGACGAGAA GCCTGAGTTG GTTGCCTCCT TGCAAGGCGA ATTAGGCAAT
GCGTTGGCAA CATCATCGAT GGGCTCCTCC CGGGCCTATA ATCTTGAGCA AGCCATCCAG
GCTTATCAGG CTGCCCTTGA GATCCGGACC CGAAACGATT TCCCCGAGCA GTGGGCCACG
ACCCAGCATA ACCTGGGCAA CGCCTATGGC AAGCGCATTC GGGGCGTGGG GGCGGAGAAC
CTGGAAAAAG CCATCCAGGC TTATCAGGCC GCCCTTGAGA TCCGGACCCG AAACGATTTC
CCCGAGCAGT GGGCCATGAC CCAGCATAAC CTGGGCAACG CCTATGGCAA GCGCATCCGG
GGTGCGCAGG CGGAGAATCT GGAGCGGGCC CTTGAAGCTT ATGAGGCCGC CCTTACAATC
TACACCCGCG ATGCTTTCCC CGAGGACTGG GCTATGACCC AGCATAGTCT GGGCAACGCC
TATCGAGATC GCATTCGGAG CGCGGGGGCG GAGAACTTGG AGCGGGCCCT TGAAGCCTAT
GGCGAGGCCG CTCGCATTTA CACGACCGAT ACGAATCCTG AAGCGGCGCG TCAGGTTCGA
CTCGGTCAAA GCGCAGCTTT ACTTAAGGCA GGCCGGTGGC AGGCGGCCCT GGATGCGAGT
GAGGAAGGGC TTGCGGCTTC CCGGATCTTG TTTGATATTA GCTTACATGA TCCGGAAGTT
CGGGAACGGG AGATTGGCAC ATCGGAAATG TTATATGCCC ACTGCGCGTT TGCCCAGGCG
CAACTGGGCA GACCTGATAA GGCGCTGGCG ATTCTAGAAG AAGGCCGGGC CCGGGAGCTG
CGTTACCGGG CAGGCCGGGA TCGGGCCGAT CTGGAAGACC TTCCAGAGCA GAGAAGGGCC
GCTTTTTTGC AAGCCGCGCA GCGGGTCCGC GATTTAGAAG CGGAATGGCG GCGGCCCGAG
GAGGAACGCC CGGCCGATCT AGCGGATAAA ACGCGCCAGG CCCGCCAGGG TCTGGAGCAG
GAAGCGCAAC AAATCCGCGT GTTAAAGCCG AATTTTCTGC GCGCGTCCGT GGATCTCCAT
GACATTCGCG CGGTGCTTCC CGGCCAAGAC GCCGCCCTGG TGGAATTTGC AGTCACCGAG
GCGGGCACCC TCGCCCTGGT GCTGCCGGCG GGGCAAGGGG CGTTGAAGCG GTCTGGATGG
AGAGCGTGA
 
Protein sequence
MPSEAVPLQL TRIQALEDVL RQLKDEKPEL VASLQGELGN ALATSSMGSS RAYNLEQAIQ 
AYQAALEIRT RNDFPEQWAT TQHNLGNAYG KRIRGVGAEN LEKAIQAYQA ALEIRTRNDF
PEQWAMTQHN LGNAYGKRIR GAQAENLERA LEAYEAALTI YTRDAFPEDW AMTQHSLGNA
YRDRIRSAGA ENLERALEAY GEAARIYTTD TNPEAARQVR LGQSAALLKA GRWQAALDAS
EEGLAASRIL FDISLHDPEV REREIGTSEM LYAHCAFAQA QLGRPDKALA ILEEGRAREL
RYRAGRDRAD LEDLPEQRRA AFLQAAQRVR DLEAEWRRPE EERPADLADK TRQARQGLEQ
EAQQIRVLKP NFLRASVDLH DIRAVLPGQD AALVEFAVTE AGTLALVLPA GQGALKRSGW
RA