Gene Noc_1978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1978 
Symbol 
ID3705437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2271786 
End bp2273318 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content51% 
IMG OID637738454 
Producthypothetical protein 
Protein accessionYP_343970 
Protein GI77165445 
COG category 
COG ID 
TIGRFAM ID[TIGR02602] eight transmembrane protein EpsH (proposed exosortase)
[TIGR02914] EpsI family protein
[TIGR03109] exosortase 1 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTAAGTT GGCCTCTGGC AACGGTGGCA ATAGCTGTGG TGTTGCTTGC CCTATTCCTG 
TTTTATCTGG AAACCGCAGT ATCCATGGCG GCTATTTGGT GGCGTTCTGC GACCTTTGCC
CATGGTATGC TCATTTTTCC GGTTAGCGGT TATATGATCT GGGCGCGACG GTGGCAGCTC
CAGCAGCTTC AACCCCATCC TCGGCCACTA GCCGCATTCT TTATTCTGCT ATTATCCGGG
GGATGGTTAC TGGCTAGGAT TGCGGACGTG CTATTTGTCG AACAATTGTT ATTGGTGGCC
ATGATTCCGG TAGTGGTGTG GGGGTTATTG GGAAAGAGGG TCGTTCGCGC GTTGGCGTTT
CCACTTGTCT ACTTGGTTTT TGCGGTACCG TTTGGTGAAT TTTTAATTCC GCCGCTGCAA
GATTTTACAG CCGCTTTTGC TGTCAAAAGC CTGCAATTCG GTGGAGTGCC GGTTTACTGG
GAAGGGCGTT ATATTAGTAT TCCCTCGGGG GATTTTCTGG TTGCGGAAGC TTGTAGCGGG
CTGCGTTACC TTATTACTTC GGTTGTTCTT GGCACCCTCT ACGCTTATCT TACTTATTCT
AGTTATGGGC GCCGCGCGGC ATTTATTGTG GCTTCGGTTA TCGTACCTAT TATTGCTAAT
GGTATCCGGG CCTACGGCGT TATTATGCTG GCTTACCTCA GTAATATGAA GTTGGCGACC
GGCGTGGACC ATGTTGTTTA CGGGTGGATT TTCTTTGGCG TAGTCATGTC GTTACTCTTT
TGGCTGGGAT CGTTCTGGCG CGAGGATAAG TGCCCCGGCA ACAGGGCCTC GTTGTCTCGG
CAATCTGGAG GGATTGTCGC CCAGCAGGCC CCCCGTGCAA AAAAACTGGG TGTAACTACT
ATTATACTGA TCCTGGTGGC CGGTGCCGGT CCCGCGAGTG GAATTTGGTT TAAAGGCCAA
GCATCGAAAA CCGACTGTTC GGTGTCTATG CCTAAGGAGC AGCCGGTCTG GAGTGGTTCT
TCCGTCCCGA CCAGTATGTG GGAGCCAGAT TATTCCCAAG CAGATCAAAT CGTGCGCCGT
CTCTATTCAT TCCCAGATGA TAGTGCGGTA CAGCTTTTAA TTATCTATTA TCAGCAAGAG
CATCAAGGAG CTGAACTTAT TAGTTCTCAA AATCGCCTCT ATGATGATCA AATTTGGCGT
TGGATGGAAG ATAATCGCCG AAGCCTCTCC CTTGGTGATG ATCATCTCCA AGTGCATGAA
ACTGTTATCC GTTCCCCTAA TACGCTGCGC GTCATTTGGC ACTGGTATGA TATTGCTGGC
CAGCGAACGG CTAGTCCGAT AAAAGCTAAA TTTCTTGAAG CCTGGGCTCA TCTTACCAAG
CAGCCAAGCG GCTCCACGCT CATCGCTGTA GCTGCGGATA GCGGTAAGCC GGAGCAAGCC
CGTGCGCTCC TGCTAAAATT TTTGAATGAG ATGCCGGCAG TCTCTACCTC AGGCGCAATG
CTAGCTTGCC AGTCAGTATC AGAGAGAACA TAA
 
Protein sequence
MLSWPLATVA IAVVLLALFL FYLETAVSMA AIWWRSATFA HGMLIFPVSG YMIWARRWQL 
QQLQPHPRPL AAFFILLLSG GWLLARIADV LFVEQLLLVA MIPVVVWGLL GKRVVRALAF
PLVYLVFAVP FGEFLIPPLQ DFTAAFAVKS LQFGGVPVYW EGRYISIPSG DFLVAEACSG
LRYLITSVVL GTLYAYLTYS SYGRRAAFIV ASVIVPIIAN GIRAYGVIML AYLSNMKLAT
GVDHVVYGWI FFGVVMSLLF WLGSFWREDK CPGNRASLSR QSGGIVAQQA PRAKKLGVTT
IILILVAGAG PASGIWFKGQ ASKTDCSVSM PKEQPVWSGS SVPTSMWEPD YSQADQIVRR
LYSFPDDSAV QLLIIYYQQE HQGAELISSQ NRLYDDQIWR WMEDNRRSLS LGDDHLQVHE
TVIRSPNTLR VIWHWYDIAG QRTASPIKAK FLEAWAHLTK QPSGSTLIAV AADSGKPEQA
RALLLKFLNE MPAVSTSGAM LACQSVSERT