Gene Noc_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1501 
Symbol 
ID3705992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1664860 
End bp1666053 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content55% 
IMG OID637737988 
Producttransposase 
Protein accessionYP_343517 
Protein GI77164992 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.915709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAAAG CCACAAAGAT ACGCCTCTAC CCTACGAGGG AGCAGGCGGA GTTTCTGAAC 
CGCCAGTTCG GTAGCGTGCG GTTTTGCTAT AACACCGGCC TTCGCATTAT GTCTCACCGC
TATAAGCGAC ACGGCCAATC CTTGAGCGCC AAGTATGACA TCAAGAAATT GTTGCCTGTG
GCCAAGAAGT CCCGTAAGTA TGGATGGTTG AAGGAGGCGG ATTCGGTTGC GCTCGCTCAA
GCGTGTATCA ATCTCGACAA GGCGTTTCAG CGTTTTTTCA AGGAAAAGAA AGGCTATCCC
CGGTTCAAGC GCAAGCGTGG CAAGCAGTCG AGCTACCATT GCATGAGCGT GAGCTGCGGT
GAAAGCTGGG TCAAGGTGCC GAAGCTCGGC CCCATCAAGG CTAGAGTTCA TCGGTCCGTT
GAGGGCAAAT TGAAAAGCAT CACCCTGTCG CGCACCGTTA CCGGCAAACA CTACGCGTCT
CTCCTGTATG AAACCGAGCA ACCTGTCCCT GAGCCGATGA CGGCTATCGA CGCCACCAAG
GTGCTCGGCT TGGACATGGG TCTCTCGCAC CTGGCGATTG ACTCCACTGG ACGAAAAGTA
GCCAATCCAC GCTTTATCAA GCAGGTTCAG AAAAATCTCA AGCGCAAGCA ACAGTCGCTG
TCACGAAAGC AAAAAGGCAG TTCCAAACGA GCAAAAGCAC GCCTACTGGT CGCTAAGGCT
CACGAGCGGG TCGCTGATGC CCGCAGTGAC TTCCAGCATA AACTGTCCCG GCAGATCGTT
GACGACAACC AAGCGGTGAT CGTCGAGACG CTGAAAGTCA ACAACATGAT GAAGAACGCC
AAGCTCGCCA AGCATATCGG CGATGCCTCT TGGCACGCCT TGATCGCCAA GCTGGCATAC
AAGGCCAAGG AGCAGGGCAA ACATCTGGTC AAGATCGACC CCTGGTTTGC CAGCTCAAAA
ACTTGCCATG TATGCCAGCA CAAGATGGAT GCCATGCCAC TAAATATCCG GTCGTGGGCG
TGCCCTACCT GCCACACGCG CCATGACCGC GACATCAACG CGGCGCTGAA CATCCAACAT
CAAGGCATTT TGAAGTTGAA GGCGGAAGGG CTGTCCGTCT CTGCCCATCG AGGCTTGCGT
AAATCCGGCA TGCCGCCGGT GGCTGCCGTT GAAGTGGGAA GCTCCGTCCG ATAG
 
Protein sequence
MLKATKIRLY PTREQAEFLN RQFGSVRFCY NTGLRIMSHR YKRHGQSLSA KYDIKKLLPV 
AKKSRKYGWL KEADSVALAQ ACINLDKAFQ RFFKEKKGYP RFKRKRGKQS SYHCMSVSCG
ESWVKVPKLG PIKARVHRSV EGKLKSITLS RTVTGKHYAS LLYETEQPVP EPMTAIDATK
VLGLDMGLSH LAIDSTGRKV ANPRFIKQVQ KNLKRKQQSL SRKQKGSSKR AKARLLVAKA
HERVADARSD FQHKLSRQIV DDNQAVIVET LKVNNMMKNA KLAKHIGDAS WHALIAKLAY
KAKEQGKHLV KIDPWFASSK TCHVCQHKMD AMPLNIRSWA CPTCHTRHDR DINAALNIQH
QGILKLKAEG LSVSAHRGLR KSGMPPVAAV EVGSSVR