Gene Noc_1428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1428 
Symbol 
ID3706036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1582897 
End bp1584540 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content49% 
IMG OID637737918 
Producthypothetical protein 
Protein accessionYP_343447 
Protein GI77164922 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCGA GGCGTCAGCT TAGATTTTGG TTGATTGGGT TTTTGCTCTT TCTCATCTCG 
GTTTACCTAC TGCGGGAAAT TTTGTTGCCA TTTGTGGCGG GCATGGTGGT AGCCTATTTG
ATTGATCCCC TCTGTGATTG GCTTGAGCGG AAGGGATGCT CGCGCACGGC GGCGACGAGC
CTAGTGACTG CAGGATTCAT CCTGGTCGTT AGCATGGTCT TGCTGTTACT TGTTCCGCTA
TTGCGGAGTG AAATTGTGCA TTTGATAGAA ACACTGCCTT CCTTGATCGC TCGGGCTCAG
GATTCAACCT GGCCCTGGTT GCAGTTACTT CAAGAGCGTT GGTCCATTGA TATGTCCCAG
ATCCAGAATG CCGCCAAGGA TCAGGCAGGC ATATTGATTA AGTGGATAGG CAAGACCGTG
GGGACGATTC TGAGTAGCGG TTTAGCATTA GCTAATCTTT TGTCGCTCGT TTTTATTATG
CCAGTAGTAG CGTTCTATCT GCTACGGGAT TGGGACAAAC TGATAGCCCA GATCGATAGT
CTGCTGCCGC GAAAACACGC CCCTGTCATC CGGGAACAAG TCAAGCTAAT CGATACGGTT
TTATCGGGTT TTATACGTGG TCAAGTCAGC GTATGTTTGC TGCTTGGCAC TTTCTATGCT
GTGGGGTTGG CGTTGATTGG GTTGGACTTC GGATTGATGG TGGGCATGCT TGCAGGGTTG
CTCTCATTTA TTCCTTATGT GGGCACTATC GTAGGTTTTA TTGCAGGGAT TGGATTGGCC
TTTGTGCAGT TCTCCGAGTG GACCCCCATC TTTCTGGTGG CGGGAGTTTT CGTGATAGGG
CAGGTTGTCG AAGGTAATGT ACTAACGCCC CGTCTGGTAG GGAATCGTGT GGGATTGCAT
CCGGTCATGG TCATTTTTGC CCTTCTAGCT GGGGGGGGAT TGTTTGGCTT TTTGGGTATC
TTGCTTGCTG TTCCGGTAGC TGCGGTGGTG GGGGTGTTAA CCCGGTTTGC AATTAAACAG
TATGTGACGA GCCGTTATTA TTTGGATCTT AGCCCAGCGG CAGATTCACC GACCCAATTA
ACCCATGATC ATATTTCGGA AGGGAAGACA TTAACAGAGG GCGGGGATCG ACGAGAACAT
AAGGTATTAC AACAAAGTCC CCCGATAGGG GTAACCCAAT ATTCGTTGGT TACAACCTGG
CGTATAGAGG CACCGATCGC CGCTGTCTGG AATGCTATTT CCAATGCCGA GAGCTGGCCA
ACGTGGTGGA ATTATGTGGA ACGGGTAGTT AAATTGAAGA CAGGCGATAA AAATGGTTTG
GGTTCCCGTT ATGGTCTCCT ATGGCGAACT CGTTTGCCCT ATAAGATTTC ACTCGAGTCC
AAAGTGACCC GTATTGAGGC CCCTGTTTTT GCTGAAGTAA TAGTGAGTGG TGATGTGGAA
GGATGGGGGC GATGGCGTCT TGCTAGCAAG GGGAGCATCA CGGAAGTGCG CTATGACTGG
CATGTGCGTG TCACCAAGTT TTGGATGAAT TGGCTAACTC CGTTGCTTAA GCCTGTTTTT
AAATGGAACC ATAGTGTGGT TATGAAGCAA GGTGGTAAGG GGTTAGCCCG TTATCTTGAT
GCTCGTTTTA TTGGCATGGA GTGA
 
Protein sequence
MSARRQLRFW LIGFLLFLIS VYLLREILLP FVAGMVVAYL IDPLCDWLER KGCSRTAATS 
LVTAGFILVV SMVLLLLVPL LRSEIVHLIE TLPSLIARAQ DSTWPWLQLL QERWSIDMSQ
IQNAAKDQAG ILIKWIGKTV GTILSSGLAL ANLLSLVFIM PVVAFYLLRD WDKLIAQIDS
LLPRKHAPVI REQVKLIDTV LSGFIRGQVS VCLLLGTFYA VGLALIGLDF GLMVGMLAGL
LSFIPYVGTI VGFIAGIGLA FVQFSEWTPI FLVAGVFVIG QVVEGNVLTP RLVGNRVGLH
PVMVIFALLA GGGLFGFLGI LLAVPVAAVV GVLTRFAIKQ YVTSRYYLDL SPAADSPTQL
THDHISEGKT LTEGGDRREH KVLQQSPPIG VTQYSLVTTW RIEAPIAAVW NAISNAESWP
TWWNYVERVV KLKTGDKNGL GSRYGLLWRT RLPYKISLES KVTRIEAPVF AEVIVSGDVE
GWGRWRLASK GSITEVRYDW HVRVTKFWMN WLTPLLKPVF KWNHSVVMKQ GGKGLARYLD
ARFIGME