Gene Noc_1661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1661 
Symbol 
ID3705641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1854815 
End bp1855840 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content44% 
IMG OID637738138 
Productaminodeoxychorismate lyase 
Protein accessionYP_343663 
Protein GI77165138 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.148869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAAAT CTTTTTTCTT TTTATTAGCG CTATCAGGAA TCGCGGTAGG GTTAGGAATA 
GTATGGTTAA AATTTGAATA TGATCGCTTT ACTCATATCC CGCTTCAGAT AGACCAAGAA
GGCTTGAATT TGGTAATTCC TAGTGGTGCC ACAATACATT CCGTTGCTAC CGAACTGTAT
CAACGGGAAG CTTTAGAGCA ACACCCTCTG TATTTAGTAT TACTAGCCCG TTGGCAGGGG
ATAGCTAGGG ACATCAAGGC TGGCGAATAC CATATTCAGG CGGCCACAAC ACCGTCGGCA
TTGCTGCGCC AAATTGTAGC AGGTAAGGTC AAACAATATA GCTTGACTTT AGTGGAAGGA
TGGACTTTTC CACAGGTAAG AAAGGCTATC CAAAACAGTC TTTATCTTCA ACAAACATTG
AATCGGCAAT TACCAGCTTC TGAGATTATG AAACGTCTGG GCTATCCTAA TGAACATCCA
GAGGGTCGGT TTTTTCCTGA TACCTACTTC TTTCCCGCTG GTACTTCCGA CGTGGATTTC
TTACGGCGCG CTTATCAATT TATGGTAAAT CATCTAACCC ATGAATGGGA AAACCGTGAG
CTTGAGCTTC CTTACCGAAG CTCCTACGAT GCTTTGATAC TAGCTTCCAT TATTGAACGG
GAGAGCGCAT TAATCGAAGA ACGGCCTTTG ATTGCTGGTG TGTTCGTGCG ACGTCTTCAA
AGGGGAATGC GTTTGCAAAC CGATCCGACA GTTATCTATG GTCTAGGGAA CCGCTTTGAT
GGAGATTTAC GGCGCCAGGA TTTAAAAAAG GATACGCTTT ATAATACTTA TACACGTTCG
GGACTTCCTC CAACGCCTAT TTGTATGCCT AGTCTAGGAG CATTACGGGC AGCGTTGCAC
CCGGCAGAAG GGAAATCATT ATATTTCGTT TCTCGTGGTG ACGGCAGCCA TCATTTTTCG
GCTACTTTTA AAGAACATAA GGAAGCAGTA CGAAACTATC AATTGGTCAG GAAAAATAAT
CATTGA
 
Protein sequence
MRKSFFFLLA LSGIAVGLGI VWLKFEYDRF THIPLQIDQE GLNLVIPSGA TIHSVATELY 
QREALEQHPL YLVLLARWQG IARDIKAGEY HIQAATTPSA LLRQIVAGKV KQYSLTLVEG
WTFPQVRKAI QNSLYLQQTL NRQLPASEIM KRLGYPNEHP EGRFFPDTYF FPAGTSDVDF
LRRAYQFMVN HLTHEWENRE LELPYRSSYD ALILASIIER ESALIEERPL IAGVFVRRLQ
RGMRLQTDPT VIYGLGNRFD GDLRRQDLKK DTLYNTYTRS GLPPTPICMP SLGALRAALH
PAEGKSLYFV SRGDGSHHFS ATFKEHKEAV RNYQLVRKNN H