Gene Noc_1770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1770 
Symbol 
ID3704787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1993890 
End bp1995107 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content56% 
IMG OID637738253 
ProductSodium/dicarboxylate symporter 
Protein accessionYP_343772 
Protein GI77165247 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.352949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCATC GCACAGCGCT TATACTCCTT GGATTGATTA TCGTTGGCGT GATCGCAGGC 
GTCCTGGCGG GATGGTACGC TGGCCCCAGG ATGGAAGCCG TAGCCTGGCT GGGCGCCCTA
TTCCTCAACG CCCTCAAGAT GACTATCATC CCCCTTATTC TATCCGCAGT GATTACCGGT
GTAGCCAGCT TAGGGGATGT CCGTAAACTA GGCCGGGTAG GTACGATCAC TATCGGCTAC
TATGCCTGCA CTACCGCTAT TGCGGTGGCT ATTGGCCTCC TCATGGTTAA TCTTATCCAA
CCCGGCAGCG GTATTTCCCT TGGAGAAGGG CCGATTCCAG AAGGGGTAGC CGCCAAGGGA
GAGATGGGCA TTGACGACAT TTTACTCTCT CTGGTCTCTC CTAATCTGGT CAACGCCGCG
GCTGAAGGAC AGCTCTTGCC TCTTATCGTC TTTGCTATTT TATTCTCTGC CACCCTCACC
ACCCTGGGAG ATAAGGGCCA GCCGGTGCTG GCCTTCTTTG AGGGCGTCAA CGAAGCCATG
ATGAAGTTAG TGGTATGGAT CATGTATCTG GCACCAGTGG GTATCTTCGC CCTGATTGCC
GCCCGCCTGG GACAAACCGG CGGCGGGGAA GCCTTTCTCG GGGAGGTCAG CGCCGTGGGC
TGGCATGTGG TGACGGTACT TTCCGGATTG GCCCTGCACT TTGGGGTGTT GCTGCTAATA
TTATTTTTTA TTACCGGCCG GGGTTGGGAT TACCTGTTCA CCATGCTCCG CGCCCTGCTG
ACCGCCTTTG GCACGGCCAG CTCCTCCGCC ACACTACCCC TCACCATGGA GTGCGTACGG
GAAAATGGCA TTGATCCCCG AGCCGTCCGT TTCGTCTTAC CCTTGGGTTC CACCATCAAT
ATGGATGGTA CAGCCCTATA CGAATCAGCG GCGGCGATGT TCATCGCCCA AGCTTATGGA
ATTTCTCTGG GCTTGGAACA ACAGGCCCTT ATCTTTGTAA CAGCCACTCT AGCCGCCATT
GGCGCCGCGG GCATTCCCGA GGCTGGTTTA GTCACCCTTG TCATTGTATT AAATGCCGTC
GGCCTCCCCC TTGAAGGGGT GGGACTGCTG CTCGCCGTTG ACTGGTTTCT AGATCGCTTT
CGCACCTCCA TCAATGTCTG GGGCGATTCA GTAGGAGCCG CCGTTTTGGC CCGCTTTTTA
CCTAATAACC CAAGCTAG
 
Protein sequence
MPHRTALILL GLIIVGVIAG VLAGWYAGPR MEAVAWLGAL FLNALKMTII PLILSAVITG 
VASLGDVRKL GRVGTITIGY YACTTAIAVA IGLLMVNLIQ PGSGISLGEG PIPEGVAAKG
EMGIDDILLS LVSPNLVNAA AEGQLLPLIV FAILFSATLT TLGDKGQPVL AFFEGVNEAM
MKLVVWIMYL APVGIFALIA ARLGQTGGGE AFLGEVSAVG WHVVTVLSGL ALHFGVLLLI
LFFITGRGWD YLFTMLRALL TAFGTASSSA TLPLTMECVR ENGIDPRAVR FVLPLGSTIN
MDGTALYESA AAMFIAQAYG ISLGLEQQAL IFVTATLAAI GAAGIPEAGL VTLVIVLNAV
GLPLEGVGLL LAVDWFLDRF RTSINVWGDS VGAAVLARFL PNNPS