Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_5666 |
Symbol | |
ID | 8886881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 6022544 |
End bp | 6024253 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003514389 |
Protein GI | 291303111 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.13145 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAGA ATTTGTCGCG CTTCGTGATG CGCACGGTGC TGCCCGGCGC ACTTCCGACG GCCGCGGTCG TCGTGTTCGC GCTCACTGAC CTGCCCTGGC TGGGCTATGG GCTGTGCCTG GCGACGCTGG TCTTCCTGAC CGGCCTGTTC ACCACCGTCA GCGCGGGTGC CGGACGCACC GCCCGCGCCG TCATCGTCGC CGCCGTGCTT TTGGGCACCG GGCTGTTCCG CGACGGCGAA CTGAACCTGG GACTGGCCGG GGCCGGGGCG CTGCTGCTGG GCCTGCTGGG CGTGGAACCG CTGGTGTTCA AGGCATTGCG GATGGGCAGG CTCAACACCG CGAACCTGCC GCTGCCGCGT TCCACCGTGG CGCGCTGGGC CACGCCGCGC ACCGTCGCGA TGGTCAACAT CGGCATGATC GCCGCCTTCA CCGGCTGCGC GGCCTTCCAG CTGTCGGGCT GGCCGCTGGT CGCCGTCGGC GTGCTGGTCG TGTTGACCCA GGCGGTGCTG GTGCTGCGGG TGTGGGTGCG GCGCCGCGAC GTCGCGTACC AGACCGACAC CGCGGTGCGC GCCGCCGTCG AGGCGCACGC GCCCAAGTTC GCGGTGCACT TCTCCGGCCC GGGCTCGGCG GTCTACCAGC TGCTGATGTG GCTGCCGTAC TTCGACCAGC TGGGCGACCC GTACGTGATC ATCCTGCGCG AGGGCCGCAC CGCCAAGACC TTCGCCGCCG CCACCCCGGC CCCGATCGTG GTGGCCCCGT CCATCGCGGC CATGGAGAGC ATGCTGGTGC CCAGCCTGCG CTCCGTCTTC TACGTCAACA ACAGCATGAA GAACACCCAC TGCGTCCGCT TCGGCGAACT GACCCACATC CAGCTGATGC ACGGTGACAG CGAGAAACCC GCCAGCCGCA ACCCGGTCAG CGCCATGTAC GACCGGGTGT TCGTGGCCGG GCAGGCCGGG GTGGAACGCT ACCGCCGCTA CGGCGTCAAG ATCACCGACG AGCAGTTCCG CATCGTCGGC CGCCCGCAGG TCGCCGACGT GCGCGTTGAC CGCGAACCCA TCGCCAACAA GAAACACCCG ACGGTGCTGT ACGCGCCGAC CTGGACCGGC GACTCCGCCG ACGTCAACTT CTCCTCGCTG CCGCTGGGCG AGGCGCTCGT GACCGAGCTG CTGGCCCGGG GCGCGACCGT CCTGATGCGC GAACACCCCT TCACCCGTCG CAACGTCGCC GCCGGCCGGG CCCTGGAGCG GGTCCAGGAA CTGCTGGCCA CCGACCGCGC CAAGACCGGG CGCCCGCACC GCTTCGGGGC CGAGACCTCC GGCGAGATCA CGCTGGCCGA CTGCTTCAAC GACGCCGACG CGCTGGTGTC GGACGTCAGC GGCGTCGTGT CCGACTGGCT GTACTCGGAG AAGCCGTTCG CCGTGACCGA CATGCTCGCC GAGGGCGAGG AGTTCGCCGA GAGCTTCCCG CTGTCGACCG CGTCATATGT GATCCGTCAC GACGGCTCCA ACATCCCCGA GGTGGTCTCG CAGCTGCTGG ACGCCGACCC GCGCGCGAGC GAGCGGCGGG CGCTGAAGAC CCACTACCTG GGCGACTTCC CCGCCGACAC CTACACCAGG GCGTTCCTGT CGGCGGCCCG CGAGACCTAC GAGACCCCGC GTGATCCGCG CGAGTCACAC GAGCGCGACC AGGCAGTGAC GGCGAGCTGA
|
Protein sequence | MPKNLSRFVM RTVLPGALPT AAVVVFALTD LPWLGYGLCL ATLVFLTGLF TTVSAGAGRT ARAVIVAAVL LGTGLFRDGE LNLGLAGAGA LLLGLLGVEP LVFKALRMGR LNTANLPLPR STVARWATPR TVAMVNIGMI AAFTGCAAFQ LSGWPLVAVG VLVVLTQAVL VLRVWVRRRD VAYQTDTAVR AAVEAHAPKF AVHFSGPGSA VYQLLMWLPY FDQLGDPYVI ILREGRTAKT FAAATPAPIV VAPSIAAMES MLVPSLRSVF YVNNSMKNTH CVRFGELTHI QLMHGDSEKP ASRNPVSAMY DRVFVAGQAG VERYRRYGVK ITDEQFRIVG RPQVADVRVD REPIANKKHP TVLYAPTWTG DSADVNFSSL PLGEALVTEL LARGATVLMR EHPFTRRNVA AGRALERVQE LLATDRAKTG RPHRFGAETS GEITLADCFN DADALVSDVS GVVSDWLYSE KPFAVTDMLA EGEEFAESFP LSTASYVIRH DGSNIPEVVS QLLDADPRAS ERRALKTHYL GDFPADTYTR AFLSAARETY ETPRDPRESH ERDQAVTAS
|
| |