Gene Nmul_A0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0402 
Symbol 
ID3785395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp445010 
End bp446086 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content54% 
IMG OID637810478 
Productpermease YjgP/YjgQ 
Protein accessionYP_411102 
Protein GI82701536 
COG category[R] General function prediction only 
COG ID[COG0795] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0017509 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAATAA TAGAACGATA TATCACGCGG GAACTGCTGA TACCGTTTAT GGTAGTCACC 
GTAATCCTTG CGACATTGTT TGCAAGTTTC AGTATGGCCC GTTTTCTTGC CGGAGCAGTG
ACAGATTCGC TTGGCCTTAT CCCCGTGCTC AGGCTCGTGT TTCTGAAAAC GCTGATCGCA
CTGGAAGTGC TGATGCCCAT TGCCCTGTAC GTGGCTGTCA TCATGGGACT GGGTCGCCTG
CACCGGGATC AGGAAATAGT CGTTTTGCGT TCCGCTGGGG TCAGCGAACA CCGCATTATC
TATGCGGTGC TCATCGTTGC GATTCCTATG GGGCTTATCA GTGGTTTATT TTCCATTTTT
GCGCGTCCAT GGGCGTATGA GGAAAGCTAC CTACTCAATG CCCAGGCAGA GGCAGAGTTG
AATACGGATC GGTTCCGCGC CGGGCGTTTC TACGGCAGCG AAAAAAAAGG CCGGGTGATT
TATGTGCAGG CCAAGGATAG CTCGGGCAAG CAGATGGGAG AGGTATTCCA CTATCTGAAC
AAGCATGACA GCAGCGAGAT CATTCTTGCC AAGAAAGCTC ACCAGCCTGA GCTTGTGTTC
GGCCAGCGCC CCCAGATACA TCTGCTGGAT GGCTCCATTT ACCGGCTATC GCACACCGGA
AAAGGCGATA CCGTCGTCCA GTTTGAAAAG CTGGTTTATT TCACGGACAG CGGAAACGTA
ACGGATTACA GGCGCAAGGC TGCCTCTACC GCGGCATTGA TGCAATCTGA TCAGCCGCGG
GATACTGCCG AGCTTCAGTG GCGGCTGTCG CGCCCGCTGG CAACGATCCT GCTGGCGCTG
ATAGCGGTGC CCTTCAGCCG CGCTTCACCC CGCCAGACAA AGGGAGATAA GACTTATTAT
CTGGCAGCTC TGGTTTTCGC CATTTACTAC ATTTTGAGCG GATTGGCCCA GACTTGGGTC
GAGCAGGGCA CGATCGGGAG GGTGCCGGGT GTGTGGTGGC TCTATGCTGT CATGCTGCTG
TTTGCAATCT CGTTATTATC GCCTGGTTTC TGGCGGAAGT TGCCTTTGCG CAGATGA
 
Protein sequence
MKIIERYITR ELLIPFMVVT VILATLFASF SMARFLAGAV TDSLGLIPVL RLVFLKTLIA 
LEVLMPIALY VAVIMGLGRL HRDQEIVVLR SAGVSEHRII YAVLIVAIPM GLISGLFSIF
ARPWAYEESY LLNAQAEAEL NTDRFRAGRF YGSEKKGRVI YVQAKDSSGK QMGEVFHYLN
KHDSSEIILA KKAHQPELVF GQRPQIHLLD GSIYRLSHTG KGDTVVQFEK LVYFTDSGNV
TDYRRKAAST AALMQSDQPR DTAELQWRLS RPLATILLAL IAVPFSRASP RQTKGDKTYY
LAALVFAIYY ILSGLAQTWV EQGTIGRVPG VWWLYAVMLL FAISLLSPGF WRKLPLRR