Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2109 |
Symbol | |
ID | 3784680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2406689 |
End bp | 2407903 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637812197 |
Product | hypothetical protein |
Protein accession | YP_412794 |
Protein GI | 82703228 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.657946 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAC TTCGCGTCCC TCATCGCACC AAGTCTGCGC GCGTGGTTCA GGAGCTGGTC CGCCTGGCTG CGGGACTTGC CGCCTCTGGC AGCCGCGTAG AAGACGCCTT CTGGGAGAAT TGCCTTACCG CAAAAATACA TGCCCAATTG CGAACGGGAG ACGATCAGAA TCTCGAATCT GCACTGGATC AGCTTTATGA CACTGATCCT GGAGGCTACA GTGAATTCAT GTATGCCATC GAAGCCGCAA TAGAATGCAG TATATTCACG CGAGGGAATC ACACTTACGA CGTTTTGATG CTTGCTGTAC CGATGCTCAC CTGGTCGCGT TTTTCGATTC CCTCGGGACT AATAGCAGAA TCCGTGCTTT CTGAACTTCG GGGGCAGTTG CAGGTGCAGG TGTTGGCGGA CGATGCCCAG CTTGCGCTTG CAGATTGTCT TTTCAGCCCC GATCAATTGC CAAGAGGCTA TCATCTTACA CATGAACTGG CTCGCAAACT CTGGTCGACG GCGGTGACCG GACAACGCGA CTTGCATATG AACCCACGGC AGCTCCCTGA AACGGGACGA TTTCTTTCTG ATACCCGTTA TCTCCTCGGC GGGATCGCGG TACCTCAGGG AAAACCCATG TTCCGCTGGC AGCAAGGCAC TGCGGGCGAA CACAAGGGCA GCAACGTTTC CGCTCTCCAG TTCTCCAAAG AGCAGATACT GCATGCATGG CAAACTCACG GTACTGCGGT GCTGCTGCCG CTATTCCAGG GCTGTGCGTT TGAGCTGCTG ATGCCGGATG GCTTTTTTTC AGCATGGCGC GCCGCGGATC GTCTGGCGCG CCCCTATTCG GTGCGCGCCA CGGTTGATTT TCTGGAAACC ACTTTGGGTA TTTCCCCCGA CAGATTGCGT GCAGTGATCG CGCCTTTCTA TGATCAGTGG CTGGAGGAAT ACCGCATTGG ATTCACCCTC AAGGATCGGG ACAGCGTGCT CTACGGCATC ATCTGGGGAC TGGTAGGGGA TGAGGATGAA AATACGGATA GTGTCGCTCA GATCGAGGCA GCGCTGCGGG AATGCGGAGT GACACAAACC ATACTACTGC AGGAGCATTT TCTTTTGGAG TACTGCGAAG AATGCGGGGG ACCCCTGTTT CCCAACGTGA ATGGTGAAAT CGTGCATGCA GAATTCCCGG AAGAAGGTGA AGTGGCCCCC ATACACCTGC ATTGA
|
Protein sequence | MKKLRVPHRT KSARVVQELV RLAAGLAASG SRVEDAFWEN CLTAKIHAQL RTGDDQNLES ALDQLYDTDP GGYSEFMYAI EAAIECSIFT RGNHTYDVLM LAVPMLTWSR FSIPSGLIAE SVLSELRGQL QVQVLADDAQ LALADCLFSP DQLPRGYHLT HELARKLWST AVTGQRDLHM NPRQLPETGR FLSDTRYLLG GIAVPQGKPM FRWQQGTAGE HKGSNVSALQ FSKEQILHAW QTHGTAVLLP LFQGCAFELL MPDGFFSAWR AADRLARPYS VRATVDFLET TLGISPDRLR AVIAPFYDQW LEEYRIGFTL KDRDSVLYGI IWGLVGDEDE NTDSVAQIEA ALRECGVTQT ILLQEHFLLE YCEECGGPLF PNVNGEIVHA EFPEEGEVAP IHLH
|
| |