Gene Noc_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1971 
Symbol 
ID3705430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2260814 
End bp2262175 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content50% 
IMG OID637738447 
Productcapsular polysaccharide biosynthesis protein CapK 
Protein accessionYP_343963 
Protein GI77165438 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1541] Coenzyme F390 synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGACTC TTGCCACCCA TTTTATTTCA AGATTTATTT TTCCACTCCA CGAGTACATT 
AAGGGGCATG GGACGGTTCG GATTTTACGT CAACTTGAAA CCAGCCAGTG GTGGTCGGCA
GATCGCCTGG CGGAATTTCA GGCAGCACGC CTGCGAAGAT TAATTAAGTT TGCTTCTCAA
CAAGTGCCTT ACTATCGGAA ACTGTTTGCA GAAGCGGGGA TCGGTCCCAA TGACATTCGT
TCTCCAGAGG ATTTGCCAAA GCTGCCGTTT CTTACTAAGG ATCTAATACG GGCGAATATG
GATGTTCTCA AAGCGGACAA TGCTAGCTCT CTTGTACGCT TTAATACGGG TGGTTCTACA
GGTGAGCCTT TGATTTTCTA TTTAGGTAAA GAGCGGGTAG GTCACGATGT GGCTGCTAAA
TGGCGGGCAA CCCGCTGGTG GGGAGTCGAT ATCGGCGATC CTGAGGTAGT GGTGTGGGGG
TCGCCGGTAG AGCTTGGGGC TCAGAACCGT TTGCGAGAGT GGCGTGATCA TCTTTTGCGG
ACTCGTCTAT TGTCAGCATT TGATATGTCC GAGCGCAACC TCGACTATTT TATCGCTGAG
ATCCGGCGCT TGCGCCCAAA GATGTTATTT GGCTATCCAA GTGCCATTGC TCACATTGCA
CGTCATGTGC AAAAGCGGGG CGGGTCCATG AACGATCTTG GGGTACAGGT TGCCTTTGTT
ACTGCGGAAC GCCTCTATGA GGATCAATAT AGTGATATCG AGTCTACCTT TGGATGTCGA
GTCGCTAACG GCTATGGCGG GCGTGATGCA GGTTTTGTTG CCCATCAGTG CCCGGAGGGC
AGCTTACACA TTACGGCAGA GGACATTGTG GTGGAATTGG TTGATCAAGC GGGGAAATTG
GTAGCGGCAG GGGAGCCCGG TGAGGTAGTG ATTACTCATC TTGCTACTAA AGAATTTCCC
TTTATTCGCT ATCGCACTGG TGATGTAGCG GTTTTTGACG ACCAGTTGTG TGGTTGTGGG
CGCTCCCTAC CAGTACTTAA AGAAATCCAG GGACGAACGA CAGACTTTAT CGTAGCAGCG
GATGGCACGG TAATGCATGG GCTTGCGCTG ATTTATGTCT TGCGCGATTT GCCGGGAATC
GAATCGTTCA AGATCATCCA AGAAAGTAAG ACCGAAATGC GGGTATTGTT GGTGCCTGTT
AATGGTGGGT ATAGCCGTGC CGTTGAAGAT GGCATTCGGC GGGATTTTCA AGATCGGCTG
GGGAAGGGAG TCCGTGTGAA TGTTGAACCA GTTGCAGAGA TACCGCCTGA ACGATCAGGG
AAATTTCGTT ATGTGGTCAG CCATGTGACT TTACATTCAT AA
 
Protein sequence
MTTLATHFIS RFIFPLHEYI KGHGTVRILR QLETSQWWSA DRLAEFQAAR LRRLIKFASQ 
QVPYYRKLFA EAGIGPNDIR SPEDLPKLPF LTKDLIRANM DVLKADNASS LVRFNTGGST
GEPLIFYLGK ERVGHDVAAK WRATRWWGVD IGDPEVVVWG SPVELGAQNR LREWRDHLLR
TRLLSAFDMS ERNLDYFIAE IRRLRPKMLF GYPSAIAHIA RHVQKRGGSM NDLGVQVAFV
TAERLYEDQY SDIESTFGCR VANGYGGRDA GFVAHQCPEG SLHITAEDIV VELVDQAGKL
VAAGEPGEVV ITHLATKEFP FIRYRTGDVA VFDDQLCGCG RSLPVLKEIQ GRTTDFIVAA
DGTVMHGLAL IYVLRDLPGI ESFKIIQESK TEMRVLLVPV NGGYSRAVED GIRRDFQDRL
GKGVRVNVEP VAEIPPERSG KFRYVVSHVT LHS