Gene Noc_2777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2777 
Symbol 
ID3705507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3153557 
End bp3154639 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content52% 
IMG OID637739253 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_344754 
Protein GI77166229 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAG ACCGGGTTGC CCAATGGATT CGTCCCGAGA TACAGCGACT CTCTGCCTAT 
CGGGTCGCCG ATGCAGCGGA TTTAATCAAA TTGGATGCCA TGGAAAATCC CTATACTTGG
TCGCCGGAAT TAATAGAGGC TTGGCTGGAG CGGTTGCGGC AAGTCAGCGT TAACCGTTAT
CCAGACCCAC AGGCTCGCAG CCTCAAGCTT CGTCTCCGGC AGTATCTGGC CCTGCCGGAA
GACATGGAGA TGATTTTGGG GAATGGTTCC GATGAACTGA TCCAGATGGT GTTACTGGCC
GTGGCGGGGC CAGGGCGATC TGTGGTTGCG CCCGAGCCCA CTTTTGTCAT GTACCGGCAG
ATTGCTGCTC TACTGGGGCT GCAATATCAG GGGGTAGCTC TGCGGGAGGA TTTTTCTTTA
GACTTACCGG CAATGCTACA GGTTATTCGG GAGCGGGTGC CAGCAGTTGT CTTTATCGCT
TATCCCAATA ATCCCACTGG TAATCTCTTT TCCGCTGAAG AATTGCAAGC CATTATTGAA
GCTTCTCCTG GGCTTGTCAT CGTGGATGAA GCCTATAGCG TGTTTGCAGG TGAAACCTTC
ATGCCCCGGT TGGAGGACTA CGATCATCTC CTGGTCATGC GAACGCTCTC TAAGATTGGC
CTGGCAGGTC TCAGGTTAGG GATGTTGATG GGAAATCCAG CTTGGATCAA GGAGCTAGAG
AAAGTACGGT TACCCTATAA TATTAACCAA TTAACCCAAG TCAGTGCCGA GTTTGCTTTG
GAGCAGCCGG GGGGGTTAGA TGAACAGGCC CGGCTCATCT GCAAGGCCCG GGCGCAGCTG
CAGAGGGCTT TGCAACAGTT ACCGGGGATT CAAGTTTATC CTAGCGATGC AAACTTTATT
CTTTTCCGTA CTCCCCCCCA TCAGGCTGAG GCGATTTTTA CTGCCATTAA GGAACGGGGG
GTCTTAATTA AGAACCTTTC CGGCCAGGGT GGCCTGTTAA CGGATTGCCT CCGGGTGACC
GTAGGCACGG CAGATGAAAA TCACGCCTTT TTGAAAGCGC TAAAAGCTGG GCGAAAAAAC
TGA
 
Protein sequence
MTKDRVAQWI RPEIQRLSAY RVADAADLIK LDAMENPYTW SPELIEAWLE RLRQVSVNRY 
PDPQARSLKL RLRQYLALPE DMEMILGNGS DELIQMVLLA VAGPGRSVVA PEPTFVMYRQ
IAALLGLQYQ GVALREDFSL DLPAMLQVIR ERVPAVVFIA YPNNPTGNLF SAEELQAIIE
ASPGLVIVDE AYSVFAGETF MPRLEDYDHL LVMRTLSKIG LAGLRLGMLM GNPAWIKELE
KVRLPYNINQ LTQVSAEFAL EQPGGLDEQA RLICKARAQL QRALQQLPGI QVYPSDANFI
LFRTPPHQAE AIFTAIKERG VLIKNLSGQG GLLTDCLRVT VGTADENHAF LKALKAGRKN