Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0743 |
Symbol | |
ID | 3786567 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 866878 |
End bp | 867939 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637810825 |
Product | hypothetical protein |
Protein accession | YP_411442 |
Protein GI | 82701876 |
COG category | [S] Function unknown |
COG ID | [COG5563] Predicted integral membrane proteins containing uncharacterized repeats |
TIGRFAM ID | [TIGR02913] probable extracellular repeat, HAF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.890074 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTCTTGA GTACCGGATT GTGCTTTGTT TCCCCCGCCT TTGCACAGCT ACAACGTTTA TACATTCTGG ATCTCAAAAG AAACCAACTG ATCACTCTTA TGCCGAGTGA AATGACGGGT AACGCTACCG GCATCAATAA TGCTGGCCAG GTAGCATGGT ACGCCATCAA GGAATATGGG CACGCTTTCA TCACCGGCCC CGACGGCGTT GGCAGAACCG ATCTCAAGAC ACTGGGCGGT GATTCCAGCG CAACCTTTGG CATTAATGAT GCGGGGCAAG TGGTAGGATG CTCACAGACA GCTACAGGTA ATTTGCATGC TTTTATTACC GGCCCCAACG GCGTGGGCAT GACAGACCTG GGGACACTGG GAGAACGTGC CAGTTACGCC ACCAGCATCA ATAATGCTGG CCAGGTGGCA GGATATACCG TCAAGGAATA TGGGCACGCT TTCATCACTG GTCCCAATGG TGCGGGCATG ACCCATCTGG AAAGCCTGCC GGACGGACTC ACAACTGTTG CTCATGATAT CAATGATGCC GGACAGGTGG TCGGGAGGGG CGTGAGGCAC GCTTTCATCA CTGGCCGCAA TGGCGTGGGG ATGAAGGATC TCGGGACCCT GGGTGGAGAT TACAGTGTCG CCTATGGCAT CAATGAGGCC GGAGAGGTGG TAGGGGGTTC CAGCACGGCT GCCGGTTATA CGCACGCTTT CATCACTGGT CCCAATGGCG TGGGGATGAC GGATCTCGGG ACGCTGGGTG GAGGTTACAG TATCGCCACT GACATCAACG ATGCCGGACA GGTGGTGGGA TGGTCCACCA TGGCTGCCGG TGATAACCAT GCCTTTATCA CCGGTCCCAA TGGCGTAGGC ATGATGGACC TTAATTCACT GCCCTGGTTA CCGCCCGGAT ACGTTATAAC GGGCGCCATC AGCATCAATG ACAGGGGACA GGTCGTCGTT ATTGCCGATC TTCCCAAACC CGAGGCTTAT ATGCTGATAT TCGCTGGCCT GGGCCTGATC GGGTTTCTGG TATGGTGGCA GAAGTCGGGA AGCAGCGCCT AG
|
Protein sequence | MVLSTGLCFV SPAFAQLQRL YILDLKRNQL ITLMPSEMTG NATGINNAGQ VAWYAIKEYG HAFITGPDGV GRTDLKTLGG DSSATFGIND AGQVVGCSQT ATGNLHAFIT GPNGVGMTDL GTLGERASYA TSINNAGQVA GYTVKEYGHA FITGPNGAGM THLESLPDGL TTVAHDINDA GQVVGRGVRH AFITGRNGVG MKDLGTLGGD YSVAYGINEA GEVVGGSSTA AGYTHAFITG PNGVGMTDLG TLGGGYSIAT DINDAGQVVG WSTMAAGDNH AFITGPNGVG MMDLNSLPWL PPGYVITGAI SINDRGQVVV IADLPKPEAY MLIFAGLGLI GFLVWWQKSG SSA
|
| |