Gene Nwi_2765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2765 
Symbol 
ID3675104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp3006796 
End bp3009021 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content63% 
IMG OID637714332 
Productmalate synthase G 
Protein accessionYP_319370 
Protein GI75676949 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01345] malate synthase G 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGAA TCGACGCCCA CGGACTGAAG ATCGCGCCTG TCCTGTTCGA TTTTATCGCC 
GGGGAAGCCA CGCTCCGGAC CGGGATCAAA CCCGACGCGT TCTGGGCCGG ACTTGCAGCG
ATCGTGCGCG ATCTCGGACC GAGAAACCGC GAGCTGCTCG CGGTGCGCGA CAGGTTGCAG
GCCAGGATCG ACGACTGGCA TCGCATTCAT AAAGACACGC CGTTCGACAT CAGCGCCTAC
ACAGCGTTCC TGACCGAGAT CGGCTATCTG GAGCCGGAGC CGGAGACGCG ACAGATCGAG
ACCGCCAATG TCGATGACGA GATCGGCAAA ATCTGCGGCC CGCAACTGGT CGTGCCGCTA
ACCAACGCGC GCTACGCGTT GAACGCCGCC AACGCGCGCT GGGGCAGCCT CTATGATGCG
CTGTACGGCA CCGACGCCAT CGCACGCGAG GCCGGCGAGG CGAAGGGATA CGACAAGGCG
CGCGGCGACA AGGTGATCGC GAGGGCAAAG GCCTTCCTCG ATATGGCAAC CCCGCTTGCG
AACGGCGCTC ATGCCGACGT CGCAGGCTAC AGCGTGCTCA ATGGACGGTT GTCGGTCACG
CTCAGGAACG GCGACACCAC TGGTCTGAAG GCGGAAACCC AGTTCGCGGG CTATCAGGGT
GATGCGAACG CTCCATCCGC GATCCTGCTC GTCAATCACG GTCTGCATAT CGACATCCAG
ATCGACCGCG CCCACCCCAT CGGCAGGCAT TATTCCGCCG GCGTCGCCGA CGTGATCATC
GAGGCGGCCA TCAGCACCAT TCTCGACATG GAGGACAGCG TCGCCGCCGT CGATGCCGAC
GACAAGGTCC TGGTGTATCG CAATGCGCTC GGCCTGATGA ACGGCACCCT CACCGACACT
TTTGAAAAGA GCGGCAAAAC CCTCACCCGC GCCCTCAACC CCGACCGCGT CTACACCGCG
CCGGACGGCG GGCAACTCAC GCTGCACGGA CGCAGCCTGC TTTTGATCCG CAACGTCGGT
CACCACATGT TCACCGACGC GGTTCTCGAC CAGCGCGGCG CCGAGATTCC CGAGGGTCTG
CTCGACGCGG CCATATCGGG CCTTCTCGCG ATTCACGACA TCAACGGGGT TTCGAAAAGG
CGGAATAGCC GCAGCGGTTC GCTCTACCTG GTCAAGCCCA AGATGCACGG CCCCGACGAG
GTCGCGCTGA CCTGCGAACT GTTCGCCCGT GTCGAGGCGA TGCTCGGACT GCGGGAGAAC
ACGCTCAAGA TCGGCATCAT GGACGAGGAA CGCCGCACCA CGGTCAACCT CAATGCCTGC
ATCCAGACCG CATCGAAACG CGTCTGCTTC ATCAATACAG GCTTCCTCGA CCGCACCGGC
GACGAGATTC ACACCTCGAT GGAAGCCGGT CCGATGATCC GCAAAGGCGA GATGAAGACG
CAGCCATGGA TCAAGGCCTA TGAGGACTGG AATGTCGACA TCGGCCTGAT CGAAGGCCTG
CCCGGCCGCG CCCAGATCGG CAAGGGCATG TGGGCCGCGC CCGACCGGAT GGCGGACATG
CTGGCGCAGA AAATCAACCA CCCGGAAGCG GGCGCCACCA CCGCATGGGT GCCGTCGCCA
ACCGCGGCGA CGCTGCACGC CTTGCACTAC CACCAGATCG ACGTGCGGAA ACGCCAGCAG
GAACTGAAGG CCGGCGGGCG ACGCGCCAGG CTGTCCGATC TTCTCACGAT CCCGGTGTCT
CATTCGAACT GGCCGCCCGA AGACGTGAAA CAGGAAATCG ACAACAACTG TCAGGGTATT
CTCGGCTACG TGGTGCGCTG GATCGATCAG GGCGTCGGCT GTTCGAAGGT GCCTGATATC
CACGATGTCG GCCTGATGGA GGATCGCGCC ACGCTGCGGA TTTCCAGCCA GCATCTGGCG
AACTGGCTGC ATCATGGCGT CGTTACCCAC GATCAGGTGA TGGATTCGTT CAAACGCATG
GCTGTCATCG TGGATGAACA GAACGCGGGC GATCCGCTCT ACAAGCCGAT GGCGCCGGGC
TTTGACAGCG TGGCTTTCAA GGCGGCCTGT GATCTGGTGT TCAGGGGCCG GGAGCAACCG
AACGGCTATA CCGAACACAT CCTGACCGAG AGGCGGCGCG AGGCCAAGCT TATGGCTCAG
GCGGCTCGAG CATTTTCCGG CGAAGCGGAT ACCGGTTCGC CGCGGGAAAA TGCGACCAGA
CAATAG
 
Protein sequence
MNRIDAHGLK IAPVLFDFIA GEATLRTGIK PDAFWAGLAA IVRDLGPRNR ELLAVRDRLQ 
ARIDDWHRIH KDTPFDISAY TAFLTEIGYL EPEPETRQIE TANVDDEIGK ICGPQLVVPL
TNARYALNAA NARWGSLYDA LYGTDAIARE AGEAKGYDKA RGDKVIARAK AFLDMATPLA
NGAHADVAGY SVLNGRLSVT LRNGDTTGLK AETQFAGYQG DANAPSAILL VNHGLHIDIQ
IDRAHPIGRH YSAGVADVII EAAISTILDM EDSVAAVDAD DKVLVYRNAL GLMNGTLTDT
FEKSGKTLTR ALNPDRVYTA PDGGQLTLHG RSLLLIRNVG HHMFTDAVLD QRGAEIPEGL
LDAAISGLLA IHDINGVSKR RNSRSGSLYL VKPKMHGPDE VALTCELFAR VEAMLGLREN
TLKIGIMDEE RRTTVNLNAC IQTASKRVCF INTGFLDRTG DEIHTSMEAG PMIRKGEMKT
QPWIKAYEDW NVDIGLIEGL PGRAQIGKGM WAAPDRMADM LAQKINHPEA GATTAWVPSP
TAATLHALHY HQIDVRKRQQ ELKAGGRRAR LSDLLTIPVS HSNWPPEDVK QEIDNNCQGI
LGYVVRWIDQ GVGCSKVPDI HDVGLMEDRA TLRISSQHLA NWLHHGVVTH DQVMDSFKRM
AVIVDEQNAG DPLYKPMAPG FDSVAFKAAC DLVFRGREQP NGYTEHILTE RRREAKLMAQ
AARAFSGEAD TGSPRENATR Q