Gene Nmul_A1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1622 
Symbol 
ID3784090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1861572 
End bp1862705 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content54% 
IMG OID637811711 
Producttransaldolase 
Protein accessionYP_412315 
Protein GI82702749 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0176] Transaldolase 
TIGRFAM ID[TIGR00876] transaldolase, mycobacterial type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.632626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCAT TGGAACAGCT ATTGCAATGC GGACAAGCCG TCTGGCTGGA TTCGATCAGT 
CGCGACCTCA TCAAGAGCGG GCAACTGCAG CGCCTGGTAA CCGGAGACAA GCTGCATGGC
CTGACAAGCA ATCCCACTAT TTTCGAGCAG GCGATCGGGC ACAGCGATGC TTACGACAAC
GCGCTGCGCC AGCTATTGCG AACCAATGAA AAACAAACCG AAAAGGCCCT GTTCGATGCG
CTGGCTATCG AGGATATTCG CATGGCCGCA GATGTGTTGC GGTCCGTATA TGATGAAACC
CATGGTGGGG ACGGGTACGT CAGCCTGGAG GTATCACCCC ACCTGGCACG CGATACCGGA
GGCAGTATCG CAGAAGCCAA GCGCTTATGG CAAGCCGTGG AGCGGCCCAA TCTCATGATT
AAAATCCCCG CTACTCCCGA GGGAATTCCA GCAATTGAGC AACTGATAAG CGAAGGCATC
AACGTCAATG TCACCCTGAT GTTCTCCCTG CGCCACTATG AGGCCGTGGC ACATGCATAC
ATTACGGGGC TTGAACGCCG TGATGCTTAT TCGCCCGGCG GAAACAAGAT ATGGCCCGTT
TCGGTCGCCT CTTTTTTTGT CAGCCGGGTG GATAACATAA TCGATCCCAT GCTGGAAAGG
ATCGGCACCC AGGAAGCGCT CGCCTTGCGC GGGAAAATTG CCATTGCCAA TGCCAAACTT
GCCTATCAAC GCTTCCGTGA GATATTTTAC GGAGAGCCAT TTGATTCCTG GCGCAAAAAA
GGTATACACG CCCAGCGGCC ATTATGGGCC AGCACCAGCA CAAAAAATCC TGCATATTCG
GATGTGTTGT ACGTCGAGGA ATTGGTCGGC CCCGACACCG TCAATACGAT GCCACTCAAA
ACGCTGGAAG CATTCCGGGA TCACGGGCGG ACTAGCAAAA CCCTTGGAAA AGGACTGGCA
CAAGCTGAGG CCGACGTGGC CCAGCTTAAG GAGCTGGGGA TCGATCTCAA TGCAGTTACC
GAAAAACTTC AAAATGACGG AGTCGATTCG TTCGCCGCAT CCTATGACAA GCTTCTTGCC
TCACTGAGGA AAAAGCGCCA GGAAATTCTC ACTACCAGCG ACCAGACAGC CTGA
 
Protein sequence
MNPLEQLLQC GQAVWLDSIS RDLIKSGQLQ RLVTGDKLHG LTSNPTIFEQ AIGHSDAYDN 
ALRQLLRTNE KQTEKALFDA LAIEDIRMAA DVLRSVYDET HGGDGYVSLE VSPHLARDTG
GSIAEAKRLW QAVERPNLMI KIPATPEGIP AIEQLISEGI NVNVTLMFSL RHYEAVAHAY
ITGLERRDAY SPGGNKIWPV SVASFFVSRV DNIIDPMLER IGTQEALALR GKIAIANAKL
AYQRFREIFY GEPFDSWRKK GIHAQRPLWA STSTKNPAYS DVLYVEELVG PDTVNTMPLK
TLEAFRDHGR TSKTLGKGLA QAEADVAQLK ELGIDLNAVT EKLQNDGVDS FAASYDKLLA
SLRKKRQEIL TTSDQTA