Gene Noc_1786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1786 
Symbol 
ID3704901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2014536 
End bp2015687 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content55% 
IMG OID637738270 
Producthypothetical protein 
Protein accessionYP_343787 
Protein GI77165262 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTCAT TAACGCAAGA TTACCCAGCT ATTTTGCTGG ACAAGCACGA GCCGCCTTGT 
CTCTCGCTCT ATCAGCCGAC GCACCGACAG CATCCTGAAA ACACCCAGGA CCCGATCCGC
TTCCGCAATC TGGTGAAGGA AATGGAGGAA TCCCTGCGCA AGAAGTATCC GTCTCGTGAT
ATTACGGCGC TACTGCGCCC ATTTGAGGAC TTAGCCGAAG ATCGGTCTTT CTGGAATCAT
ACCGCAGACG GGCTGGCGGT GCTGAGTGCG CCTGGGCTGT TCCGCGTGTA TAGACTGCAG
CGGCCAGTTG TCGAGCTATC GGTTGTCGCC GACAGCTTTC ATACCAAGCC GCTCATGCGC
ATTGTGCAAT CGGCCGATCG CTACCAGATT CTTGGTTTGA GCCGGCACGC GTTCAAGATG
TTCGAGGGCA ATCGCGATGC GCTGGATGAG ATTCAGCTCA TTGAAAGCGC GGCGCAAGTG
ATCGATGAAC AACAAGGCAA AGATGAGGGT GATCGTGAAG GCGCTCACCG CGCATACAGC
TCTGCTGGGC GGCCCGGTGC CGCAGCGCGG CACGGCACCG ATGTAAAGCA AGATGTTGAG
GATCGCAACA CCCAACTATT TTTTCGCGCT GTGGATGAAG TAGTGCTGAA GCATTATTCC
CAACCATTGG GACAGCGCCT AATACTGGCG GCGCTACCTC AGCACCATCA TCTGTTTCGC
GCCATTAGCA GTAATCCTTT GCTAATGAGC GAAGGTATTA ACACTAACCC AGAGGCGTTA
TCGCTTGATG CGTTGCGTGA ACGCGCATGG CAATTGGTGC AGCCCTATTA CCTTGAGCGG
CTAGCCGGTT TGGTGGAATC GTTTGGCGCG GCGGCTGCAA AGGGGCAGGG CGCGGATGAC
CTTAGCGAGA TCGCCACGGC CGCCATCGCC GGGCGGATTG CAACGTTATT GATCGAGGCC
GACCGTTTAA TTCCCGGTTA TATTGATGCC ACAAGCGGTC AAATTACCAC TGATAATTTG
AGTAACCCGG AAATTGACGA TGTGCTCGAT GATCTCGGCG AGCATGTGCT CAAGACCAGC
GGCGAAGTTG TGATCGTCCC CAGTGAACGA ATGCCAACAC AGACTGGCGC CGCCGCCATC
TATCGTTTTT GA
 
Protein sequence
MNSLTQDYPA ILLDKHEPPC LSLYQPTHRQ HPENTQDPIR FRNLVKEMEE SLRKKYPSRD 
ITALLRPFED LAEDRSFWNH TADGLAVLSA PGLFRVYRLQ RPVVELSVVA DSFHTKPLMR
IVQSADRYQI LGLSRHAFKM FEGNRDALDE IQLIESAAQV IDEQQGKDEG DREGAHRAYS
SAGRPGAAAR HGTDVKQDVE DRNTQLFFRA VDEVVLKHYS QPLGQRLILA ALPQHHHLFR
AISSNPLLMS EGINTNPEAL SLDALRERAW QLVQPYYLER LAGLVESFGA AAAKGQGADD
LSEIATAAIA GRIATLLIEA DRLIPGYIDA TSGQITTDNL SNPEIDDVLD DLGEHVLKTS
GEVVIVPSER MPTQTGAAAI YRF