Gene Nmul_A0249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0249 
Symbol 
ID3785735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp266239 
End bp267717 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content49% 
IMG OID637810324 
ProductO-antigen polymerase 
Protein accessionYP_410949 
Protein GI82701383 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGACAG TCATTATTCT TTCGATAGCT GTCTTTGTTT TTGCGCTTGG TTCAATCGTT 
GCAATGAGCG CTGTGGCAGG CGCAGCTTCC CGCTGGCCAG GTCAAATGTT CAGCTATCTG
ATGTGGATGA TAGTGATGGC AGGTGTGGGC GGTATCCTCG TTTCAGGGCG TATTCTCAGA
ATAGACGAAG AAGCTTTGGT TATGGGAGCA GTGGGCGAAG CTGGGGGCAC GATAATTGCC
AAATTGCTCC TCTCCGCTGT AATCGGGGTT TCCCTCGCTT TATGCGTGAC ATGGATATTG
TTGTCGAGCA AAAGGAAGGC AGGGGGTAAC CGTTTTGAAC AAAGGAGACT GAATTCGCCA
AATGATATCG TGATTGCGTT TATGGTGTTC TATATCGCGT TCAGCATTCT GCCGCTCATC
TTTGGAAAGA GCCATCAATT TCACGTATCA CTGTTTTATC CATTTTTCGT GTTTATCGCC
TTATTTCTGT GGATGCGGTT GTCGAAGGTT GACCCGGTTA TCGTGGCGAA GCAGTGCCTG
GGATTTATTG TATTGACTAG TTTGGTCATG GCCGTGCTAA TTCCCCAGCT GGCCGTTCAG
CCCAGCTATG TAGGACTTAT CCCCGGATTC AATTCGCGAC TTTGGGGGTT AACGGCAGGA
GCAAACTCTC TTGGATCAGT CGCTGGCACT CTTCTCGTGC TGGAGGCCGC CGAGCCATCC
GCCAGGAGAT GGCTCGGCAA TGGAATTTTT TTCACTGCTG CCCTAGCCTT GGTTTTAACT
CAATCCAAGA CCTCCATTCT GGCAGCGTTC CTGGGGCTTT TGATCATTTT TGGATATCGA
CTGGTGACCG GGCTTCAAGG AAAAAGCTTG AACGGACGTA ATGAAAATTT AATTTTAATT
ATTCTAATAG CGTTTTTTAT TTTGTTTATC ACGGCAGTCA GTGCGTGGGT GATGTTCTTT
GATACAAGCG TCTTTACTTC ACTTGAACGC AGCCTGGATT CGCGAGCGGT TAGCAAATTG
GCAACGGCAA GCGGACGAAC CTGGATATGG GAAGTTGCTT TGCGAGGAGG AATGGAGAAT
CCTCTGTTCG GACAGGGTTT AGGCTTCTGG AGCTTGGAGA ATCGGCTTCG GTGGGGGCTG
GGGGGTGCCG TACATGCCCA TAATTTGTTT CTTGACGTGT TTGCCCGTTC TGGATTTGTG
GGCTTGAGCA CACTATTGGT TTTTCTCTAT TTTGTTTTTC GCTACTCCGT ACGCGCGACC
CGGTACACGC ATGGAGGCAG CATTGCGTTG GCAGTCATCT TTCTCGTTCG AGCGACGTTT
GAAGTGCCAC TTCAACCAAA TGCCATTCTA GGAGCGGAAT CCATGGCAAT GTTGGCTTTT
TTCCTCTATG TAATCGATAG AGGAGCCAAA CAGCGCGACA AAGCCAATGA GCCCGTCCAA
GTGCGGGCAC ATTTTTTAAG AGCAGGAAAC TTCCGATGA
 
Protein sequence
MLTVIILSIA VFVFALGSIV AMSAVAGAAS RWPGQMFSYL MWMIVMAGVG GILVSGRILR 
IDEEALVMGA VGEAGGTIIA KLLLSAVIGV SLALCVTWIL LSSKRKAGGN RFEQRRLNSP
NDIVIAFMVF YIAFSILPLI FGKSHQFHVS LFYPFFVFIA LFLWMRLSKV DPVIVAKQCL
GFIVLTSLVM AVLIPQLAVQ PSYVGLIPGF NSRLWGLTAG ANSLGSVAGT LLVLEAAEPS
ARRWLGNGIF FTAALALVLT QSKTSILAAF LGLLIIFGYR LVTGLQGKSL NGRNENLILI
ILIAFFILFI TAVSAWVMFF DTSVFTSLER SLDSRAVSKL ATASGRTWIW EVALRGGMEN
PLFGQGLGFW SLENRLRWGL GGAVHAHNLF LDVFARSGFV GLSTLLVFLY FVFRYSVRAT
RYTHGGSIAL AVIFLVRATF EVPLQPNAIL GAESMAMLAF FLYVIDRGAK QRDKANEPVQ
VRAHFLRAGN FR