Gene Noc_A0029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_A0029 
Symbol 
ID3704297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007483 
Strand
Start bp26762 
End bp28843 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content56% 
IMG OID637736524 
Producthelicase-like 
Protein accessionYP_342072 
Protein GI77163546 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAAC TGATCAAGGT TGAATACGGT CGCACCGGCA AGAGTACCAA TAGCAATACC 
TTGGGTATGC GGGAGATGCA GGCCCGAGCA TACGAAAAAC GCGACAGTCA GTACTTGTTG
CTCAAGGCAC CACCGGCCTC CGGCAAATCG CGAGCACTGA TGTTTCTGGC CCTCCATAAG
CTCCATTGCA CTGGCCTGCG CAAAGCCATC GTTGCTGTGC CGGAGATATC GATTGGGGCA
TCCTTTAAGA GCACCGACCT TACTACTCAC GGCTTCGAGT GGGACTGGCA CGTGGAAGAC
CGCAACGACC TATGCCGCGC CGGCAATGAC AGCGGCAAGG TGGACGCTTT CTGTCGCTTC
ATGGCGAGCC CGACCGATCG TATCCTGGTC TGCACCCACG CCACGCTGCG ATTCGCGTTT
GCCAAGCTCG GTCCTAGCGA CTTTGATGAC TGCCTGGTGG CCATCGACGA ATTTCACCAT
GTCTCGGTCG ATGAAGGCAA CCGTCTGGGT GGATTGATCG ACGATCTGAT GCTGGGCGCC
ACGGCGCATA TCGTGGCGAT GACCGGCAGT TACTTCCGCG GTGACGCCGC ACCGATCCTG
CTGCCGGACG ACGAAGCCCG TTTCAACAAA GTGACCTACA CTTACTACGA ACAGCTCAAC
GGCTACCGCT ATCTGAAGTC ACTGGGCATT GGCTACCATT TCTACCCAGG GCGCTACCTC
GATGCGATTA ATGAGGTGCT TGATCCTGAT AAGAAAACCA TCGTCCATAT TCCCAACGTC
AACAGCGGCG AAAGCACCAA AGACAAGTAT AGTGAGGTCG ATCATATTCT CGATGCGCTG
GGTGAGGTGG AGGCCCAGGA CAGCCAGACC GGTATCGTAA CAGTGCGCCA TCCCTCGGGA
CGCTTGCTCA AGGTTGCTGA CCTGGTAACG GATACGCCTA TCCGCCGCGA TGTGCAGGAG
TATCTGCGCG GCGTAGAGCA CCCCGAAGAC GTGGACGTGA TCATTGCCTT GGGTATGGCC
AAGGAAGGCT TTGACTGGCC GTTCTGTGAG CATGTACTGA CCATTGGCTA TCGCCGATCG
ATGACCGAGA TTGTCCAGAT CATCGGGCGT GCCACGCGTG ACAGCGAGGG CAAGACTCAT
GCCCAGTTCA CTAACTTGAT CGCTCAGCCG GACGCAGACG ATGAGGACGT GAAGGTCAGC
GTCAACAACC TACTCAAGGC GATCACGGTG TCACTGCTGA TGGAACAAGT GCTGGCACCT
AGCGTGTCCT TTAAGCCGCG TTCGCGGATG CAGGCGGGTG AAGAGACGGA GCCAGGCACG
ATAATCATTG AGGATACTCA GGCGCCAGTC TCGGACAAGG TGATCAGCAT CCTCCAGTCA
GGCGGCCAGG AAGACATTAT TGCCGCACTC CTGCAAAAGT CGGAAGTGGC TGCTGCGGCA
GTGACGGAGG TTGTCGAGCC TGAGGTAATC AATGAGTTCG AACTACCTAA GGTTATCGAG
CGTCTGCACC CTGAGCTGGA TGCTAACGAG CGCGAGCTAC TCCGCGAGAC AGTGCTAACC
ACCATGGGTG CCCGCGCCAC AGGCGGGTTG TTTAACGAAG AAGACTTGCC CGAAGGCGCT
GAGATACATG AGCCTGGCCA AATCTATGAC ACACATGGCG ATGGGCAGCC GGCCGAGCTC
GGTACTGCGT CCGATGGACG TGACCTCAGT GATGACAGCA GCGAGGGTGG GGCATCGGGC
AGCCGTCAGT TCCTCAAAAT TGGCGATAAG TTCGTTAACA TCGAACACCT CAATATTGAC
TTGATCAATC AGATCAACCC TTTCCAAGGG GCCTATGAAA TTCTCTCCAA GGCGCTGACG
GCACCGGTGC TCAAGGCGAT TCAGGACACC GTGGTAGGCG ACCGGGCGCA GATGACCGAG
GAGGAAGCGG TGATATTGTG GCCCAGAATT GTAGAGTTCA GGAAAGCTAA GGGACGAGCA
CCGATGGTGC ATAGTGATGA TCACATGGAG GTCCGCTACG CTCTGGCACT GGCCTATATC
CAGCATAAAA AGCGTGAGCG TATGCAGGAG GCCACAACAT GA
 
Protein sequence
MTELIKVEYG RTGKSTNSNT LGMREMQARA YEKRDSQYLL LKAPPASGKS RALMFLALHK 
LHCTGLRKAI VAVPEISIGA SFKSTDLTTH GFEWDWHVED RNDLCRAGND SGKVDAFCRF
MASPTDRILV CTHATLRFAF AKLGPSDFDD CLVAIDEFHH VSVDEGNRLG GLIDDLMLGA
TAHIVAMTGS YFRGDAAPIL LPDDEARFNK VTYTYYEQLN GYRYLKSLGI GYHFYPGRYL
DAINEVLDPD KKTIVHIPNV NSGESTKDKY SEVDHILDAL GEVEAQDSQT GIVTVRHPSG
RLLKVADLVT DTPIRRDVQE YLRGVEHPED VDVIIALGMA KEGFDWPFCE HVLTIGYRRS
MTEIVQIIGR ATRDSEGKTH AQFTNLIAQP DADDEDVKVS VNNLLKAITV SLLMEQVLAP
SVSFKPRSRM QAGEETEPGT IIIEDTQAPV SDKVISILQS GGQEDIIAAL LQKSEVAAAA
VTEVVEPEVI NEFELPKVIE RLHPELDANE RELLRETVLT TMGARATGGL FNEEDLPEGA
EIHEPGQIYD THGDGQPAEL GTASDGRDLS DDSSEGGASG SRQFLKIGDK FVNIEHLNID
LINQINPFQG AYEILSKALT APVLKAIQDT VVGDRAQMTE EEAVILWPRI VEFRKAKGRA
PMVHSDDHME VRYALALAYI QHKKRERMQE ATT