Gene Noc_0018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0018 
Symbol 
ID3705951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp11319 
End bp12410 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content42% 
IMG OID637736542 
ProductRecF protein 
Protein accessionYP_342090 
Protein GI77163565 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID[TIGR00611] recF protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCATA TTACTCATCT TGACATACGC AACTTCAGAA ATCTAAAGCA TATTGAATTG 
CATCCTAGCA AAGGGGTCAA TATCCTTTCG GGTGCCAATA GCAGTGGAAA AACCAGCTTT
CTAGAAGCCA TTTATTTGCT TGGCTTAGGA CGTTCTTTCC GCACCGTTCA ATTAATTTCG
GCTATTCAGG CCGGCATGGA ATCGCTCCGT GTTGTAGCCA AAGTGAAGCA GGTGGGCGGT
TCCCATACCG CAGGAGTAGA GTTTGGTCCT GCTGGTTTTC GAGCACGTAT CAACAAAGAT
ACGGTAAAGA AACGTTCCCA ATTAGCCACC CAATTACCCT TGTTATATAT GTCTTCTTAT
AGTCATGTTG TACTCGATGG AGGACCTCGT TACCGCAGGC AATGGCTCGA CTGGAGTTTA
TTTCATCTGG AGCCCGGATT TCACGATCTA TGGTGGTGTT ATCAACGAAC GCTTAAGCAG
CGTAATCATG TATTAAGAGT TCATAAGCCT AGCTGGCAAC AGGAAATTAA TGCTTGGAAT
AAGAAGCTCT CTACCTATGG GGAGCAAATT ACTTCGCTGC GAGAAGCTAT TCTCTTTAAA
CTACAGGATA GCGTATCACA GTTATTTACG GCCTTGGCCC ACCAACCAAT TTCCCCTGTT
ACTATGGAAT TTAAGCAAGG TTGGGCTCGC ACGGTGAGGC TAGAAGAAAT TCTAAATGAA
TCGTTGAACT ATGATCGAGC AGCGGGTTAT ACACGATATG GACCTCACCG TGCAGAAGTG
GCATTTTATG TGGATGGAAA AGATGTTAGG GAAATTTTAT CTAGAGGTCA GCAAAAGGTA
TTTTGCTATT CTCTTGCGTT GAGTCAGGCA AATCTATTAT ACAGAACTAA AGAACAAAAT
TGTATTTTCT TAATTGATGA TTTTACTTCG GAACTCGATG CTGATCATCG AAAGCGGCTT
TTAACATTGT TAAATAAGCT AGGCATGCAG GTTTTTGCCA CTACTATAGA ATCATTAGGT
AGTGAAATAA AGGCACATCC TAATATCAAG GAGTTCCACG TGAAACTGGG GCAGGTAGAA
GAAATGGTAT AA
 
Protein sequence
MMHITHLDIR NFRNLKHIEL HPSKGVNILS GANSSGKTSF LEAIYLLGLG RSFRTVQLIS 
AIQAGMESLR VVAKVKQVGG SHTAGVEFGP AGFRARINKD TVKKRSQLAT QLPLLYMSSY
SHVVLDGGPR YRRQWLDWSL FHLEPGFHDL WWCYQRTLKQ RNHVLRVHKP SWQQEINAWN
KKLSTYGEQI TSLREAILFK LQDSVSQLFT ALAHQPISPV TMEFKQGWAR TVRLEEILNE
SLNYDRAAGY TRYGPHRAEV AFYVDGKDVR EILSRGQQKV FCYSLALSQA NLLYRTKEQN
CIFLIDDFTS ELDADHRKRL LTLLNKLGMQ VFATTIESLG SEIKAHPNIK EFHVKLGQVE
EMV