Gene Noc_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_3040 
Symbol 
ID3704339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3436182 
End bp3437210 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content50% 
IMG OID637739514 
Productcytochrome oxidase assembly 
Protein accessionYP_345011 
Protein GI77166486 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1612] Uncharacterized protein required for cytochrome oxidase assembly 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00748587 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCGCC GGTTTTATGT TTTTGCTTTA GCAGCTAGCC TATTAGCCCT GGTCGTTGTG 
GTGGTAGGCG CTTATGTGCG CCTTTCTGAT GCTGGATTAA GCTGCCCGGA TTGGCCAGGT
TGTTACCAAA AGCTATTAGC ACCTACCACT GAGCAGCAGG TTGATCATGC TAATCGCCTC
TATCCGGATC GCCCGGTGGA GACAGCTAAA GCCTGGAAAG AAATGATCCA TCGCTACCTC
GCGGGTATCT TGGGATTATT GATTTTAGGG CTTGCTATTG CCGCCTGGCG TAACCGTTCT
GATCCCACTC AGAAAGTGGC TCTGCCCTTA TTTTTACTTG GATTAGTGGG GTTGCAAGCT
GCGTTGGGGA TGTGGACCGT TACTCTCTTG GTGCAGCCGG CTATTGTGAC GTTACATCTT
CTAGGAGGAA TGGCGGTTCT AGCCTTGGTT TGGTGGCTAG CATTGCGGCA GCGACAGGCA
CGACGCCCTA TGGAGAAAAT CTGGTATTCA CCAGCTTTTA AGCTTTTGGC ATTAATAGGC
TTATTTCTGC TAGTACTGCA AATCATCCTT GGAGGTTGGA CAAGCACCAA TTATGCGGGC
TTCTATTGTT CGGATTTTCC CACCTGCCAA GGGCAGTGGT GGCCAACCAT GGATTTTCGT
GAGGCTTTCA CATTTTGGCA GCCGCTAGGG GAAAATTATG AAGGTGGACG GTTAGCGCCG
GAGGCAGCAG TGGCTATTCA TGTTATCCAT CGGATTGGCG CCGTAGTGGT TTTGATAGTG
CTAAGTGCTC TTGGTATACG GGCAGGGTTA GGCCGAGGCA CTCCCGCGTT ACGCAGCGTC
GGGTGGATAG TTGTTATGTT AGTCCTTATC CAGGCAGCGC TAGGCATTGC CACCGCTATG
GGAGGAATTC CATTAGCGCT GGCGGTAGCG CATAACGCTG TAGCTGCATT ATTGTTACTT
GCCGTCGTTA CTTTGAATCA TTTGCTCCAT CCTACAGGGT ATCCATTACA AGGAGCTACA
AGACTATGA
 
Protein sequence
MSRRFYVFAL AASLLALVVV VVGAYVRLSD AGLSCPDWPG CYQKLLAPTT EQQVDHANRL 
YPDRPVETAK AWKEMIHRYL AGILGLLILG LAIAAWRNRS DPTQKVALPL FLLGLVGLQA
ALGMWTVTLL VQPAIVTLHL LGGMAVLALV WWLALRQRQA RRPMEKIWYS PAFKLLALIG
LFLLVLQIIL GGWTSTNYAG FYCSDFPTCQ GQWWPTMDFR EAFTFWQPLG ENYEGGRLAP
EAAVAIHVIH RIGAVVVLIV LSALGIRAGL GRGTPALRSV GWIVVMLVLI QAALGIATAM
GGIPLALAVA HNAVAALLLL AVVTLNHLLH PTGYPLQGAT RL