Gene Noc_1381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1381 
Symbol 
ID3706106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1530369 
End bp1531523 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content54% 
IMG OID637737876 
Producthypothetical protein 
Protein accessionYP_343405 
Protein GI77164880 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCAG ATTCCCAAGA ATCGTCGCCC TCCTCCGCAG TAGCCGGACC CATTGATGTC 
CGCAGCACCG CGCTGGCGCT GCTCGCTCTA TTTGCCACGA TCCTGATGTT ACAGTGGACC
CAGGCAGTAT TGGTGCCATT GGTGTTCAGC ATATTAGTAA GCTATTCCCT GGACCCCATA
GTCAGCGCCC TGGAACGGCT TAAAGTGCCC CGCTGGCTGG GGGCAACGCT ACTGGTAATG
CTGTTTATAG GACTCCTGGG CTATGGTAGC TATACCCTTA GAGACCAAGC TATGGTGCTT
CTGGATAAGA TCCCCCAAGC GGTGCAAACG CTACGCCATT CCATGCAAGT TAAACCCGCC
GACTCCCGTG AAGGAGTCAT CAAAAAAGTT CAGGAGGCCG CAGAAAAAAT ACAAGAAGCA
ACTAAATCCG CAGATGACAA CGCTACCTCT AGCAAGCCGG GAGTCATGAA GGTGGAAATC
GTGGAGCCAG GACTCAAGCT GGGAGAATAT GTCTGGTGGG GTTCACTGGG GGTTTTGGCC
TTCCTCGCGC AGTTGGCCAC AGTCGTCATG CTGGTGCTGT TGTTCTTGGT TTCTGGCGAT
ACCTACAAAC GCAAGCTGGT GAAGATCACA GGCCCCACCC TCTCCGAAAA AAAGGTGACG
GTGCAAATCC TGGATGATAT TAACATGCAG ATCCGCCGCC ATTTATTTGT GCTGGTGATT
TCAGGCGTTT TCGTGGGGCT AGCCACTTGG GGCGCCTTCC TATGGATTGG GCTAGAACAA
GCAGCGCTTT GGGGACTCAT TGCCGGGGTA GCAAGCACGG TGCCCTACCT GGGGCCGGCC
GTGGTGTTCG CCGCCACCAC AATCGTGGCC CTGGTGCAAT TTGGCACGAT CACGATGGGA
CTTGGGGTAG GCGCTGCCTC TCTCCTCATT ACTGGCATTC AAGGCAACTG GCTTACCCCT
TGGTTGACCA GCCGCACCTC CAGCATCAAT GCCGTTGTGG TCTTTATAGG TCTCCTATTT
TGGGGCTGGC TATGGGGACC TATCGGACTC ATTGTCGCCA CCCCCATTCT GATCATCATC
AAAGTTTGCT GCGATCATGT GGAAAATCTA ACGTCCCTCG GCGAACTGAT GGGAAAAGGC
TCCTATGAAG ATTAA
 
Protein sequence
MPSDSQESSP SSAVAGPIDV RSTALALLAL FATILMLQWT QAVLVPLVFS ILVSYSLDPI 
VSALERLKVP RWLGATLLVM LFIGLLGYGS YTLRDQAMVL LDKIPQAVQT LRHSMQVKPA
DSREGVIKKV QEAAEKIQEA TKSADDNATS SKPGVMKVEI VEPGLKLGEY VWWGSLGVLA
FLAQLATVVM LVLLFLVSGD TYKRKLVKIT GPTLSEKKVT VQILDDINMQ IRRHLFVLVI
SGVFVGLATW GAFLWIGLEQ AALWGLIAGV ASTVPYLGPA VVFAATTIVA LVQFGTITMG
LGVGAASLLI TGIQGNWLTP WLTSRTSSIN AVVVFIGLLF WGWLWGPIGL IVATPILIII
KVCCDHVENL TSLGELMGKG SYED