Gene Noc_1467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1467 
Symbol 
ID3705997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1626598 
End bp1627932 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content52% 
IMG OID637737955 
Producthypothetical protein 
Protein accessionYP_343484 
Protein GI77164959 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACCCA TCCATCATGA TAGAAAAAAG GGCGTAGCGC GCCCTTCACT GGATGATATT 
TTCAATGGCC CGGATGAGTT CGGGCTTCTG AATGTTCAGG CAAAGCGATC CAGCGGTGGA
AACCCGCTGG AAGTGGCCAA ATTTGAAGAG ATCAATACAT TCATGGATCA GCATGGTAGA
GCCCCTGAAA GTACAGGCAA GCTGTCAGAA AAGCTCTTGG CCAGGCGCTT AAAGGGCTAC
TTGGACAACC ATGATCTGCA TTCCACTCTG AAGCCTTATG ATCGTCATGG CTTGCTTCCG
GCGCCAACGT CGTTTGTAGA CGAGGTTGAT GCTTCTGATT TCGAACTGGA CGAACAAACC
CAGCCGGACA GTGAAGTGGA ACAGGTTTCA TCGGATGTGG TGGCAGATGC CGAGGATGTT
ACCTCCCTGG ATGATATTTT TGCCAGTGAG GCATTTGCGG AGATCGATCA GGGGGAGCCT
GCCCTCTTTG ATTCTATCCA TGTCCCGTTC AGTTCGGACC GAGAAGCGCC TGATGAGATA
GCTCAGCGTC GTGTCTGCCA GGATTTCTAT GCGTTTGAGC CGACCTTTCG TGACCTGCAT
GAGAAGCTCA AATCAGGTGA CGCAAAGACA GTACGGTTCC AACAAGCCTC ACAGGTACAG
CCAGGCGATG CTTTCATACT GGAAGGCGTC GTATGCCTGA TCGATGAGGT TGGTGAGTAT
CGAGAAGATA ACCAGGGGCG ATACGACCCT AGGCTGCGCG TGATTTTTGA GAACGGAACG
GAGTCCAACC ATCTTCTGCA ATCATTAGCC AAGCGACTTT ACCAGGATGA GACGGGACGC
CGGATCATCC GTGAGGCCGA CTCAGTGGTT GATGCTTTCA ACAACGTATC CCATAAGGAC
AAGCGGGTCG GGCAGATCTA CTTCGTCACC ACCAGGAGCG AAAACCCTGA TCTGAAGGCG
ATTCCGAACC TGGTTAAGAT TGGTTATACC GAGCTTACGG TCGAAGAGCG GACGAAAAAT
GCTGAGCGGG ACACTGCTTT CCTGGAAGCA CCGGTCAAGA TCCTTGCTTC GATGGAGTGC
TACAACCTGA ACCCCAACAA GTTCGAAAGC CTGATTCATG GGTTTCTGTA TGCACAGCGG
TTGAAGGTGG CGTTGATCGG GAAAGACGGG AAAGCCTATC ACCCGAAAGA GTGGTTTTCC
GTTCCTTTGG ATACAGCAAG GGAAGTCGTT AAACGGATCA TTGATGGCAG CATCGTCCAT
TACCGGATGG ACAACACCAC TGGACGGTTG GTGAAGAAAA GGATTGGTTC GATCTGCATC
CCGAACCGCG CTTAA
 
Protein sequence
MPPIHHDRKK GVARPSLDDI FNGPDEFGLL NVQAKRSSGG NPLEVAKFEE INTFMDQHGR 
APESTGKLSE KLLARRLKGY LDNHDLHSTL KPYDRHGLLP APTSFVDEVD ASDFELDEQT
QPDSEVEQVS SDVVADAEDV TSLDDIFASE AFAEIDQGEP ALFDSIHVPF SSDREAPDEI
AQRRVCQDFY AFEPTFRDLH EKLKSGDAKT VRFQQASQVQ PGDAFILEGV VCLIDEVGEY
REDNQGRYDP RLRVIFENGT ESNHLLQSLA KRLYQDETGR RIIREADSVV DAFNNVSHKD
KRVGQIYFVT TRSENPDLKA IPNLVKIGYT ELTVEERTKN AERDTAFLEA PVKILASMEC
YNLNPNKFES LIHGFLYAQR LKVALIGKDG KAYHPKEWFS VPLDTAREVV KRIIDGSIVH
YRMDNTTGRL VKKRIGSICI PNRA