Gene Noc_2721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2721 
Symbol 
ID3704747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3090347 
End bp3091588 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content58% 
IMG OID637739203 
Producthypothetical protein 
Protein accessionYP_344704 
Protein GI77166179 
COG category[S] Function unknown 
COG ID[COG2995] Uncharacterized paraquat-inducible protein A 
TIGRFAM ID[TIGR00155] integral membrane protein, PqiA family 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTTCT CTGCACAGGC CACGTTAGCC GTCTGCCATC ACTGCGACTG GGTGATGACC 
TTGCCACGAC TACGGGCGGG CGAGACCGCT TGCTGCCCCC GATGCGAACA CAAGTTACCT
GGCCAACAGC ACACTTCAAT TCAAAGCCAA CTTGCCTGGG CCAGCGCGGC ATTAATCATG
TTGGCAGCCG CGATCGCCTT TCCCTTCGTG AGCTTCGAGG TACAGGGGAT TAAACACACC
ATTATCGTGG CAGATACCGC ACTAGCCCTC TTTGACTACG ATTTTCCATT CTTGGGACTG
GTTGTGCTTA CGACCACGAT TTTGTTGCCA ACCGCCTATC TACTGGTCCT CCTTTATCTC
CATGGAGTGC TGGCTTCGGG CCGCCGTCCC ATGGGGGCGC AAACACTAGC GCGGCTACTG
ACAAGCATCA AACCTTGGGT GATGAGCGAT GTGTTTGTGG TCGGGGTGCT AGTGAGCATG
ATCAAGGTGC TCTCCCTGGC GAGTCTGCAG CTCGGTCCTG CATTCCCGGC GTTTTGCGCC
TATGCCGTGT TGCTGTTGAA GTCCATCTCT AGCTTCGATC CTGGAACCTT GTGGACGGCT
ATCAGTGGCC CGGTCGATCC TCCGGTTGAT CTTGCCCCCG GCAGTCCTGC GGCTACTCAA
GGAGCCGCAG GCTGCACCCG CTGCAATGCC ATCGTCAATA CTGCCAGCCA GACACGCTGC
CCACGTTGCG GCTACCATCC TATCGCGCCC AACCCGCGCC GCTTGCAAGC AACCTGGGCA
TTGCTCATCG CAGCAGGCAT CTTGTATATT CCAGCCATGG CCTATCCCAT CATGATCACC
ACCGAGCTCG GTAGAACCTC ACCACAGACC GTAGTGGGCG GCGCCCGGCT TTTGCTGGAG
ACCGGTTCCT GGCCCATCGC CCTGATAATC TTTACGGCGA GCATTGTGGT GCCTATCGGT
AAGGTGCTCG CTCTTGGCTG GCTTTGTTTG CAGGCACAAG CGGGTACCGG GCGCAGCGCC
TATGACCGAC TCAGGCTGTA CCGGCTTGTC GAGGCAATCG GCCGCTGGTC GTTCCTCGAC
GTATTTGTAG TTGCCCTGTT GACCGCCCTC ATTCAGGCAG GTGAGCTTAT GCGCGTACAG
CCCAGCCCCG GTGTGGTTAT CTTTGCAATT GTGGTGATCC TCACCATGCT GGCGGCAATG
GCCTTTGACC CCCGCCTGAT TTGGCGGGTC CATGAAAAAT GA
 
Protein sequence
MDFSAQATLA VCHHCDWVMT LPRLRAGETA CCPRCEHKLP GQQHTSIQSQ LAWASAALIM 
LAAAIAFPFV SFEVQGIKHT IIVADTALAL FDYDFPFLGL VVLTTTILLP TAYLLVLLYL
HGVLASGRRP MGAQTLARLL TSIKPWVMSD VFVVGVLVSM IKVLSLASLQ LGPAFPAFCA
YAVLLLKSIS SFDPGTLWTA ISGPVDPPVD LAPGSPAATQ GAAGCTRCNA IVNTASQTRC
PRCGYHPIAP NPRRLQATWA LLIAAGILYI PAMAYPIMIT TELGRTSPQT VVGGARLLLE
TGSWPIALII FTASIVVPIG KVLALGWLCL QAQAGTGRSA YDRLRLYRLV EAIGRWSFLD
VFVVALLTAL IQAGELMRVQ PSPGVVIFAI VVILTMLAAM AFDPRLIWRV HEK