Gene Noc_1051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1051 
Symbol 
ID3707234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1158218 
End bp1159558 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content53% 
IMG OID637737556 
ProductAcetyl-CoA carboxylase, biotin carboxylase 
Protein accessionYP_343089 
Protein GI77164564 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00003463 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGATA AGATTGTTAT CGCCAATCGG GGCGAAATCG CCTTGCGTAT CCTCCGGGCT 
TGCTGGGAAT TAGGGCTCAA AACCGTAGCC ATCCACTCTG AAGTGGATCG CGAACTCAAA
CATGTTCTAC TAGCGGATGA GACGGTTTGT ATCGGCCCTG CGGCATCTTC TCAAAGCTAC
TTGAATATTC CCGCCGTGAT CAGCGCCGCT GAGATTACCG ATGCCGTCGC CATTCACCCA
GGTTACGGTT TTTTGTCCGA AAACGCCGAC TTTGCTGAAC GCGTTGAACA AAGCGGCTTC
GTCTTTATCG GGCCGCGGCC TGAGACTATC CGTCTGATAG GCGATAAAGT CTCCGCTATT
AAGGCCATGA AGTCCTCCGG CGTGCCATGC GTACCCGGCT CCGAAGGGCC CCTCGGAGAA
GATGATGAGG AAAATATAGC CATTGCCAAG GAAATCGGCT ATCCGGTCAT GATTAAGGCT
TCAGGGGGAG GGGGAGGCCG AGGAATGCGC GTTGTTCATT CTGAGGCGCA TTTGCCCACC
GCTATTTCCC TCACCCGGAG CGAAGCCAGC GCCGCCTTTG GCAATGACAT GGTTTACATG
GAAAAATATC TGGAAAATCC TCGTCATGTG GAATTCCAAG TTCTGGCCGA CACCCACGGT
CAAGCCATCT ACCTCGGCGA GCGGGACTGT TCCATGCAGC GCCGTCACCA GAAAGTTGTT
GAAGAGGCAC CTGCCCCAGG CATTACCAAT GAACAACGGC AACGCATGGG AGAAATCTGC
ACTGAAGCCT GCCGCAAGAT GGGTTACCGA GGAGCAGGTA CGTTTGAATT TCTCTATCAA
GATGGCGAAT TTTATTTCAT TGAAATGAAT ACCCGAGTCC AGGTGGAACA CCCTGTAACT
GAAATGATCA CCGGGATAGA CATTGTCAAG GAGCAACTCC GTATTGCTGC CGGAGAGAAG
CTCAGTTATC GCCAGGAAGA TATCATGATC CACGGGCACG CCATCGAGTG CCGCATCAAC
GCCGAGGATC CCACTAATTT CATGCCCAGC CCAGGAACGG TGACAAGATA TCATACGCCT
GGTGGCCCGG GCGTCCGAAT AGATTCCCAC CTATACGCTG GTTATACTGT TCCCCCTCAC
TACGATTCTT TGATCGGCAA ACTCATTACC CATGGGGAAA CCCGGGAAGC AGCCATTGCG
CGCATGCAAA TTGCACTCAC TGAACTGGTC ATCGATGGCA TTAAGTGTAA TGCGCCACTC
CATCAAAAAA TCCTCGACAA CACGCACTTC CGGGCTGGCG GCGCTAATAT CCACTACCTA
GAGCGAATGC TAGGATTATA G
 
Protein sequence
MLDKIVIANR GEIALRILRA CWELGLKTVA IHSEVDRELK HVLLADETVC IGPAASSQSY 
LNIPAVISAA EITDAVAIHP GYGFLSENAD FAERVEQSGF VFIGPRPETI RLIGDKVSAI
KAMKSSGVPC VPGSEGPLGE DDEENIAIAK EIGYPVMIKA SGGGGGRGMR VVHSEAHLPT
AISLTRSEAS AAFGNDMVYM EKYLENPRHV EFQVLADTHG QAIYLGERDC SMQRRHQKVV
EEAPAPGITN EQRQRMGEIC TEACRKMGYR GAGTFEFLYQ DGEFYFIEMN TRVQVEHPVT
EMITGIDIVK EQLRIAAGEK LSYRQEDIMI HGHAIECRIN AEDPTNFMPS PGTVTRYHTP
GGPGVRIDSH LYAGYTVPPH YDSLIGKLIT HGETREAAIA RMQIALTELV IDGIKCNAPL
HQKILDNTHF RAGGANIHYL ERMLGL