Gene Noc_0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0053 
Symbol 
ID3705929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp50043 
End bp51848 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content57% 
IMG OID637736578 
ProductTOPRIM domain-containing protein 
Protein accessionYP_342125 
Protein GI77163600 
COG category[L] Replication, recombination and repair 
COG ID[COG0305] Replicative DNA helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00140147 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAAA CATTTTCCGA TTTTGGGATT GATGTGCCGC CCGCCGCTTC CGGGCAACTC 
AGTCTCACCT GCCCCCAGTG CTCCGCTCAA CGCAAGAAAA AACGTGCCAA GTGCTTAAGC
GTCAATGTCG AGAAAGGGGC GTGGATATGC CACCATTGTA GCTGGCGAGG GGGGCTTTCC
CAGCGGGAGC AGTCCAATCG CACTCTGTAC TGGCGACGCC CCGATTACCG ACAGCCGGCG
CCCTTTTCCC CGGGAGCCCT GCCTGAAGAT ATCCAGCGCT GGTTTGCCAA ACGGGGGATT
ACCCCCGCGG TCCTTGAGCG CAATCATATC GCGACCAAAA AGGTGTATAT GCCCCAGCTG
GAGAGGTGGG TGAGCGCGAT CGCCTTTCCT TACTACCGGG GCGAGACGCT CATCAACGCC
AAGTACCGGG ATGGGAGGAA ACACTTCCGC CTGGAGGCCG GGGCCGAGCG CATTTTGTAT
GGACTTAACG ACTTGGAGCA GACTACCCTC ATTGTGGAAG GGGAGATGGA TAAACTGGCG
TTAGAGGTAG CCGGTTTTCG CAATGTGGTC AGCGTCCCCG ATGGCGCCCC CCCACCCCAG
GCTAAGGATT ATGCCCGCAA ATTTGAGTTT CTGCAGGCGG ATGAAGAAGC GCTTAAGACG
GTCAAGACGT GGGTAATCGC AGTCGACAAT GACGCACCCG GGCAGTATTT GGCCGAGGAA
CTCTCCCGTC GCTTCGGGCG AGAAAAATGT AAGCGGGTCC TCTGGCCTGA AGCGTGCAAG
GATGCCAATG AGGTGTTGCT GAAGCGGGGA CCTGAGGTGC TCACCGATTG CATCAAAAAT
GCCCAGCCTT ATCCTCTCGC CGGGGTGTTG ACGGTCAGCC ACCTCAGCGA AGACATCGAT
TTTCTCTATA CCCATGGGCT CAAGCGGGGG ATGTCAACCG GCTGGCCCTC TGTGGATATA
TGTTACACCG TCAAGCCTGG GGAGTTGACG GTGGTGACGG GTGTCCCCAA CAGTGGGAAA
TCCAATTGGC TGGACTGCTT AGCCCTGAAT CTTGCCCAGC AGGGCTGGCG CTTTGGTGTC
TTCAGCCCCG AGAACCAGCC AGTGGGCCAC CATATGGCGC GGATGATAGA AAAGTGGGCC
GGTAAGCCGT TCAATAAAGG GTCTATTGCC CGACTGAGCC GGTCCACGCT GGCGCAGGGA
AAGGACTGGG TGCATGAGCA CTTTTATTGG ATTCTCCCAG AGGATGACCA GGATTGGACC
GTCGAGCACG TTCTGGACCG TGCCAGGGCG CTGGTGCTGC GGTATGGGAT TAAGGGGCTC
CTGCTCGACC CCTGGAATGA GTTTGAGCAT CTGCGTGCGC CCAATGTCAC GGAGACGGAG
TATATCTCGT TGGTTTTAAA GCGGGTGCGG CAATTCGCCC GTTATTACCA GGTCCATGTG
TGGATAGTGG CCCATCCGGC AAAGCTCTTC CGGGGCAAGA ACGATCAATA TCCCGTGCCC
ACGCTCTATG ACATCTCAGG CTCGGCTAAC TGGCGCAATA AGGCCGATAA TGGCCTCGTG
ATTTGGCGCG ATCTTGGCGA CCCTAAAAAA GATTTGGTGG AGATTCATAT CCAAAAGATT
CGCTTTCGGG AGGTGGGAAG ATTGGGCGCA GTGCGGCTGC GTTTTGACCC TGTGACGGCA
GTGTACCGGG AGCCTGAACC CGATGATGAA GCGGCCTTCC CCCCTGCGGA TGGAGCTGAT
AAGGCGGATG AGCAAGCGTA TCTGGACAGT CTGTACGCCG AGTATGAGGC CCAGGGCGGA
AAATAG
 
Protein sequence
MDKTFSDFGI DVPPAASGQL SLTCPQCSAQ RKKKRAKCLS VNVEKGAWIC HHCSWRGGLS 
QREQSNRTLY WRRPDYRQPA PFSPGALPED IQRWFAKRGI TPAVLERNHI ATKKVYMPQL
ERWVSAIAFP YYRGETLINA KYRDGRKHFR LEAGAERILY GLNDLEQTTL IVEGEMDKLA
LEVAGFRNVV SVPDGAPPPQ AKDYARKFEF LQADEEALKT VKTWVIAVDN DAPGQYLAEE
LSRRFGREKC KRVLWPEACK DANEVLLKRG PEVLTDCIKN AQPYPLAGVL TVSHLSEDID
FLYTHGLKRG MSTGWPSVDI CYTVKPGELT VVTGVPNSGK SNWLDCLALN LAQQGWRFGV
FSPENQPVGH HMARMIEKWA GKPFNKGSIA RLSRSTLAQG KDWVHEHFYW ILPEDDQDWT
VEHVLDRARA LVLRYGIKGL LLDPWNEFEH LRAPNVTETE YISLVLKRVR QFARYYQVHV
WIVAHPAKLF RGKNDQYPVP TLYDISGSAN WRNKADNGLV IWRDLGDPKK DLVEIHIQKI
RFREVGRLGA VRLRFDPVTA VYREPEPDDE AAFPPADGAD KADEQAYLDS LYAEYEAQGG
K