Gene Noc_2375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2375 
Symbol 
ID3704815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2722653 
End bp2723930 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content49% 
IMG OID637738858 
Producthypothetical protein 
Protein accessionYP_344363 
Protein GI77165838 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTTTA ATACTTCGCT TAGTGGTCTG AATGCCGCAT CGTCTGATTT GGATATAACC 
GCTAACAATA TTGCTAACGC GAGTACCACA GGTTTTAAGC GGTCCCGGGG TGAGTTCGCG
GATATCTTTG CCGTTTCGTC TTTTGGTCGC TCCCAGACAG CGGTGGGTAG CGGGGTTTTG
CTCAATAAAA CGGCCCAGCA GTTTGAGCAG GGTAACTTGG ATTTTACTCA AAACTCTCTG
GATCTGGCTA TCAGCGGCCG GGGACTTTTT GCCTTGTCGC CCAGTTTAAA TAGCGATGAT
CTAGTTTATT CCCGGGCAGG CGCCTTTAGT GTCGATAAGG AAGGTTATGT AGTGAATAGT
TCGGGCCAAT ACTTGCGGAC TTTTCCCACT AATGCAAATG GCACCGCCAC CGCTGCTAGT
TTGAGTAATT CCCATCCATT GCGACTTCCT ACCTCGGCAG GCACCCCCCA GGCAACTTCC
CAAGTCAGTA TTGGGGCCAA TTTACCTTCC GATGCCGCTG CCCTTGACCC GGCCAGTTTG
GATCTTTTAG ATGCCAGTAC CTATACTGCC TCTACTTCAG AAACGGTTTA TGATTCCTTA
GGCAATAGTC ATATCTCGAC TTTATATTTC CTCAAGGATA CCAATGGTAT TAACCAATGG
GCCGTTTATC ATTCCCTGGA TGGTACCCTC GCTAATATCA ATGGAGGAAC CGCTGGTGCC
GGTGGGATCC AGTATGGAAC CCTCAACTTT GACGCAACGG GTATTCTTAC CGGGTCAGTT
CCGGATCCTC TGATAACCGA TCCCCTCGCC CTCAATAATG GGGCGAACGA TATAGCTATG
AGGTTGGATT TTGCGGCTAA TAATACGACT CAAGTTGCTT CGCCATTTAA TGTGGCGGTC
CTAAACCAGG ACGGTTTTAC GTCGGGTCGT CTTACCGGCC TGGATATCAG CAATACAGGC
ATTATTCAGG CTAATTATAG CAACGGGCAA AATTCCACGC TAGGTAAGAT TGCCTTGGCT
CAGTTTCCTA ATGAGCAGGG CTTGCGCCAA TTGGGAAACA ATGCTTGGGC AGAAACAGTA
AGTTCGGGGA CCGCCTTGGC GGGAGAGGCG GGGGTCGGTA GCTTTGGTTT GATCCAAACG
GGAGCCCTGG AAAGCTCCAA CGTGGATCTC ACTGCGGAGT TGGTGCATTT GATTACCGCG
CAGCGTAATT TTCAGGCCAA TGCAAAGGCT ATCGAAACAG CTAGCACTGT TACTGATACG
ATTATCAATA TTCGTTAA
 
Protein sequence
MAFNTSLSGL NAASSDLDIT ANNIANASTT GFKRSRGEFA DIFAVSSFGR SQTAVGSGVL 
LNKTAQQFEQ GNLDFTQNSL DLAISGRGLF ALSPSLNSDD LVYSRAGAFS VDKEGYVVNS
SGQYLRTFPT NANGTATAAS LSNSHPLRLP TSAGTPQATS QVSIGANLPS DAAALDPASL
DLLDASTYTA STSETVYDSL GNSHISTLYF LKDTNGINQW AVYHSLDGTL ANINGGTAGA
GGIQYGTLNF DATGILTGSV PDPLITDPLA LNNGANDIAM RLDFAANNTT QVASPFNVAV
LNQDGFTSGR LTGLDISNTG IIQANYSNGQ NSTLGKIALA QFPNEQGLRQ LGNNAWAETV
SSGTALAGEA GVGSFGLIQT GALESSNVDL TAELVHLITA QRNFQANAKA IETASTVTDT
IINIR