Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_1383 |
Symbol | |
ID | 8882570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 1464145 |
End bp | 1467225 |
Gene Length | 3081 bp |
Protein Length | 1026 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003510183 |
Protein GI | 291298905 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.980355 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGACTA TGACGTTGTT GCACGTGGTA TTGCGGGGGA TCCGTTATCG TCCCGGGCGT TCCGTCGTCG TGCTGGCGCT GGCCGCCGTC GCGACCGCGG CCGCCGTCAT GACCCCGGCC TACACCTCCG CGGCGCACCA GTCGCTGTTG ACCGACGAGC TCACCGACCT GCCGCCCGGC TTCGACCGGG CCCGCGTCGC CGCCGAAGGT CACCGCGCCA AGCCCGACGC CGAACACTTC GACGACATCT ATGAATCCAT CGACGCGAAG GTGTCGGGTT CGCAAGAGCT GTCGCCGTTG GTCGACCCGG TGCGGTACGT GTCGACCAAC GTCTCGGTGT CGAAGGTCGA CTGGCCCGCC TTCTCACAGG TGATCTACCG CGAGGGCCTG TGCGAGCGAC TGCGGATGGT CGAGGGGAAG TGCACCACCG GCAAAGGCGA GGTGCTGATC TCCAAGCGCT CGGCCGCGTT CGAGGACGCC GAGGTCGGCG ACACCATCGA ACTGGGGACC CGGCGGGTCG CCGAGGCCGG GACCGCCGAG GTCACCATCG CCGGGATCTA CACACCGCGC GACGCCTCCT CGGACTACTG GGGACTGGTC AATCCGTTCG CCCACGCGAT CGGCCCGGAC GACTACAACC TCGACGCCCT GTTCATGACC GACCCGGAAT CGACGACCGC CGTCGGACGT CCGGTGGAGG TCGGGGTGGA GTACACCCTC GACATTCCTG CGATCCGGCT GGCCAACGTC GGACAGCTGA AGGCGGAACT CGCCACGATC GACAAGAGCG GCAGTTTCGC GGGCCGGACG GCGACGACCG AATCCCGGAT ATCGGGTGTG CTCAAGGACG TCGCGCAATA CGAGGCGTCG GTCGCGGCCT CGGTGCCGCT GGTGACGATT CCGCTGCTGG TGCTGTCGTG GTTCGTGCTG TACCTGGTGG TGGCCCGGCT CAGCGAGGAG CGGGGCCCGC AGATCGCGCT GGCCAAACTG CGCGGGCACC GGTTCACGGC CGTCACCGGG TTCGGCACCG GCGAGGCCAT GGTGCTCATC GTGGCGGGGG CGCCGGTGGG CGCGCTGCTG GGCTGGCTGC TGGTGCAGAC GGTGGCCTGG CAGGCGCTGG CCAAGGACGC AGCACCCACC CTCACCGTCG ACATCGTCGT GTACGTCGCG GTCTCGCTGC TGGGTGCCTG GGCCGCGGCG CTGGCGGCGG CGCGACCGGT GCTCAACCGC CCGGTGCTGA CCCTGCTGCG CCGGGTCCCG GCGCGGGGGA ACTGGCGGGC CGGGCTGCTG GAGGGCGCGC TGGTGGCGCT GGCCGCCGTC GCGGTGTGGC AGGTGCTGTC GGCTTCCGAC GCGGGCGCCA TCGGCCTGCT GGCGACGCCG CTGGTCGCGG TCGTGGTGGG AGTGGCCGTC GCGCGGCTGC TGGAACAGCT GGCCCGGCGA CGACTGCCGG TCGCGCTGCG CAAGGCGAAA CTGGCCCGGA TGCTGGCGCT GGCGCAGGTT TCCCGGCGAC CGGAGACCCG CCGGATGGTC GTGCTGGTGA CCGTGGCCGC GGCGGTGGTG ACCTTCGGCG TGTGCGCCTG GGACGTCTCC GAACACAACC GGGAACTGAC CGCCTCGGAC GAGATCGGCG CCGACAGGGT GTACACCCTG TCCGCCGGTG ACCCGCAGAC CGTCATGGAC ACGGTGGAGA AGCTCGACCC CGAGGGCGAC GAGCTGATGG CCGCGATGCG CCGCGTCGAC CGCTATGACA ACCAGGACTT CTCCACCATC GCGGTGCAGT CCGACCGGCT GTCCAAGGTG GCCAGCTGGC GGGACATGTC GCCCGCGCGG CTGGGCGATC TGGCCGGTCA GCTGCATCCC AGGCGTCCGG AACCGTTGCG GGTCAAGGAG GAGATCTCCG TCAAGGCCGA CGTCGACGAC TTCGACGCCG CGCGGAAACC GTCGCTGGTC GCCAAGGTCG TACCCGACGG CGCGGACCCG GTGCTGCTGC GGCTGGGCAA CCTGTCGAAG GGCTCGGACA CCTACGAGGT GTCGGCCCCC GAATGCGATG ACGGCTGCCG CCTCATCGGT GTCGGCGTCT CCCCGCACCC CGCGGACTTC GAGGGGCTGT CGGCGAAACT GACGGTCTCC GAGATCTCCG ACGCCGACGG CGAAGTCGAC GCCTCACTGT CCGAATGCGA CAACTGGCAG CCCATCGGTG ACCTGCCCGC CGACACGCCG CTGGAACTCG ACTGCTCCGA GGGCTTGCGC ATCTCGGTCG ACAGCGACGA CTCGCACGAC TTCCTGGCCG AGTACGCCAG CAACCCCTTG GTGCTCCCGG CGGCGGTGGC CGGAAGCATC CCGGCCGCCG AAGTGGACGG AGACGAGTTC GCGAGCATGG GCCCGCAGCG GGATCTGCAA CGCTACCAAC GCGTCCAGGC CGTCACCGTG GTCCCGCGCG GCGGCGTCCG CGCGATGATG GTGGACCTGG AGTACACCAA CCTGGCCGCG CAGAACTACA CCGCGATCAA GGACCAGGAG GAGGTCACCT TCGAGGTCTG GGCCAACGCC GCCGCCGACC CGGATCTGGC CGCCCGCCTC GCCGAGGAGG GCCTGGTGGT CAAGGCGTCC GAGAGCCGCG ACACGCTGCT GGAGCGGATG TCGCGCTCGG GGCCCGCGCT GTCGCTGCGG CTGTACCTGG TGGCCGCGAT CCTGGCTCTG CTGCTGGTCA TCGGCGCGGT GCTGCTGTCG TCCACCGTGG GCGCGGCGGT GCGCGGCTAC GACAACGCCG CCCTGGCCGT GTCCGGTGTG ACGCGACGGC AGCTGCGAGC CGCGTCCATC AGGGAGCATC TCTATTGGAT CGTCTTCCCC GCGGCGGCCG GGATCGGTGC CGGGTTCGCC GGTCTGGCGC TGGTGCTGCC CAGCGTGCCG CTGGTGAGCG ACGAGGCGCC CGAGGTGGCG ATGGCCTACG AGCTGCGGCC GCTGCTGCCC GGCGGCGCGC TGGCGTTGAT GGCCTGCGCC TTCGCCGCCC TGGTGTGGCT GGCGGTCCGG CTGGGACGGC GCCGAGGCTC GGCACGACGA CTTCGCGACA GCGTGAGCTA G
|
Protein sequence | MRTMTLLHVV LRGIRYRPGR SVVVLALAAV ATAAAVMTPA YTSAAHQSLL TDELTDLPPG FDRARVAAEG HRAKPDAEHF DDIYESIDAK VSGSQELSPL VDPVRYVSTN VSVSKVDWPA FSQVIYREGL CERLRMVEGK CTTGKGEVLI SKRSAAFEDA EVGDTIELGT RRVAEAGTAE VTIAGIYTPR DASSDYWGLV NPFAHAIGPD DYNLDALFMT DPESTTAVGR PVEVGVEYTL DIPAIRLANV GQLKAELATI DKSGSFAGRT ATTESRISGV LKDVAQYEAS VAASVPLVTI PLLVLSWFVL YLVVARLSEE RGPQIALAKL RGHRFTAVTG FGTGEAMVLI VAGAPVGALL GWLLVQTVAW QALAKDAAPT LTVDIVVYVA VSLLGAWAAA LAAARPVLNR PVLTLLRRVP ARGNWRAGLL EGALVALAAV AVWQVLSASD AGAIGLLATP LVAVVVGVAV ARLLEQLARR RLPVALRKAK LARMLALAQV SRRPETRRMV VLVTVAAAVV TFGVCAWDVS EHNRELTASD EIGADRVYTL SAGDPQTVMD TVEKLDPEGD ELMAAMRRVD RYDNQDFSTI AVQSDRLSKV ASWRDMSPAR LGDLAGQLHP RRPEPLRVKE EISVKADVDD FDAARKPSLV AKVVPDGADP VLLRLGNLSK GSDTYEVSAP ECDDGCRLIG VGVSPHPADF EGLSAKLTVS EISDADGEVD ASLSECDNWQ PIGDLPADTP LELDCSEGLR ISVDSDDSHD FLAEYASNPL VLPAAVAGSI PAAEVDGDEF ASMGPQRDLQ RYQRVQAVTV VPRGGVRAMM VDLEYTNLAA QNYTAIKDQE EVTFEVWANA AADPDLAARL AEEGLVVKAS ESRDTLLERM SRSGPALSLR LYLVAAILAL LLVIGAVLLS STVGAAVRGY DNAALAVSGV TRRQLRAASI REHLYWIVFP AAAGIGAGFA GLALVLPSVP LVSDEAPEVA MAYELRPLLP GGALALMACA FAALVWLAVR LGRRRGSARR LRDSVS
|
| |