Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_2963 |
Symbol | |
ID | 8884162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 3125966 |
End bp | 3128329 |
Gene Length | 2364 bp |
Protein Length | 787 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | glycosyl transferase family 2 |
Protein accession | YP_003511731 |
Protein GI | 291300453 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0080512 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000919293 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCACGG AACTGCACCA CCCGACGCGA CAGCGCACCG CGGTCGCCGC CGCCATCGAC GGCGTCGCCG TCGTCATGCC CGCCTACGGC GAGGAGGCCA ACCTCGCCGC GACCGTCACC GACTTCCTGG ACACACTGGA CGACGCCAGG ATCCCGCACG CCGTGGTCGT GGTCAACGAC GGCAGCACCG ACCGCACCGG CGACGTCCTC GACTCACTGG CCAGACGCTA TCCCGGCCGG GTCGTGGCCG TGCACCACGA CCGCAACCGC GGCTACGGCG CGGCGGTGCG TTCCGGCCTG GACGCCGCGT TGCGGCACAC CGAACTGGGC CAGATCCTGC TCACCGACTC CGACCGGCAG TTCCACGCCG CCGATCTGCT GGAGTTGCGA CGCCGCAAGG CCACCGAACG CGCCGACGCC ATCCTGGGTT ACCGGGAGCG CCGCGCCGAC CCGTGGCATC GCCGCCTCAA CGCCCGAGTG TGGACACTGC TGTGCAAGAC GCTGCTGCGG CTGCCCGGCC GCGACGTCGA CTGCGCCTAC AAACTCATCG ACCGCCGCCT GCTGGAGGAA CTGAGCCTCA CCGGTGAGGC CGCGGCGATC TCACCCGAAC TGGTCAGCCG GATCACCGTG GACGGCAACC GGGTTGTGGA ACACCCGGTG CGGCACTATC CGCGCACCCA CGGCGAACAG ACCGGCGCGC GGCTGTCGGT GGTGGCGCGG TCGCTGCTGA GCCTGGCCGG GGTCTACGCC AGACTCGTCC GCGACGCCCA CCGGCTGGTG TGGCTGCGAC GGCTGGCACG TCCGGCCAAC CCGACCGCCG CCGTCGTCAC CATCCTGGCG GCACTGCTGG CGGCCGCCGC CTATGTGTTC TATCTGGACC AGGGCGTTTT ACTGGCCTAC AAGGACGCCA ACTCGCACCT GCTGATCGCG CGGCGCGTCG TGGCCAGTCC CACCGCCGGA CTGGCCCAAC TGGGCGGGGT GTGGCTGCCG CTGCCGCACC TGCTGTCGGC GCCGCTGGCG GCCTGGGAAT CCTGGTACCT CAACGGTTTC GCCGGGGCCC TGATCTCGAT GATCTCGTTC GTGCTGTGCG TGCGCTACCT GTACCTGCTG GGGGAGACGC TGACCCGCAC CCGGCTGGGC GGCCTGGTCA CGGCGGCGCT GTTCGCGGGC AACGCCAACA TCCTGTACCT CCAGGCCACC GCGATGACCG AACTGCCGCT GTTCGCGTGC CTGGCCGCCG CGACTTACCA CCTCGACCAA TGGTGCCGGG CCGCCCGCAG CCGAGACCTG GCGGCCGCCT CGGTGGCGGT GCTGGCCGCG ACCCTGACCC GCTACGAGGG CTGGGTGTTC TGCGCCGCGG CACTGGTGAT CGTGATCTAT GTGGAGCTGC GGCGGCACCG CAGCTACTCC AGGACCGAGT CCTCACTGGT CATCTTCGGA TTCCTGGCCT GCGCGGGAAT CGCGGCCTGG CTGGTGTGGA ACCTGGTGAT CTTCGCCGAC CCGCTGTACT GGCAGCGCGG CGAGTACGCC GAGTCCTCCC TGTGGGTGGA CGGCAACGAC GTCAACGTCG GCGAACTGGA CATCGCCTCG GGCACCTACG GACTGGCGAC GTACCACAAC ATCGGGCTCG TCACGCTCGC GATCGCGGCG GCGGGGATGC TGCTGTACCT GATCAGACAC CGGATCCGAC GCGAATGGGT GGCGATCTAC CCGCTGCTGG GCTTCGGACC GTTCTTCGTG GTCGCGCTGT ACACCGGGCA GCGACCGCTC AACGTCGTGG AGTACACCGG CCACTTCTAC AACGTCCGCT TCGGACTCAT CATGCTGCTG CCCGCCGCCG TCTTCGCCGG TTACCTGATC TCCGTGGGCG TCAGCGCGGT ACGGCGGTTC CGGTTCCGGT CACTGCGGTG GACGGCGGTG GCCGTCGCCG TCCTGGTAAC CGCGTTCGGC ATCGCCACCG TGCCGGGGGT GGTCACCCTT CGCGACGCGG TGGAGTTCCG CGAGACCGAC TGGGAGAACG CCGCGACCGC GCGCTGGCTG CGCGACAACC ACGACGGCGA CCTGACCCTC ATGATGTCGT TCGCGAACGA GTCGGTCACC TACGACTCCC GCATCCCCAC CGAATCGCTG GTCTACGAGG GAACCTACCG GCTGTGGGAA CGCGCACTGG CCGACCCGCA CGCGCAGGGC ATCGAATGGA TCTACATGAG AGCCCTTCCC GACGCCGAGG ACCTGGTGTG GCACGCGCTG TACGACACGC CGCAACTGGA GGACCACTAC ACGCTGGTCT ACGAGTTCCA CGACCGGCTG GTGTTCCGGT CCACAAAGGC ACTGACCGAG GAGTCGGAGG GCGAGGATGA CTGA
|
Protein sequence | MTTELHHPTR QRTAVAAAID GVAVVMPAYG EEANLAATVT DFLDTLDDAR IPHAVVVVND GSTDRTGDVL DSLARRYPGR VVAVHHDRNR GYGAAVRSGL DAALRHTELG QILLTDSDRQ FHAADLLELR RRKATERADA ILGYRERRAD PWHRRLNARV WTLLCKTLLR LPGRDVDCAY KLIDRRLLEE LSLTGEAAAI SPELVSRITV DGNRVVEHPV RHYPRTHGEQ TGARLSVVAR SLLSLAGVYA RLVRDAHRLV WLRRLARPAN PTAAVVTILA ALLAAAAYVF YLDQGVLLAY KDANSHLLIA RRVVASPTAG LAQLGGVWLP LPHLLSAPLA AWESWYLNGF AGALISMISF VLCVRYLYLL GETLTRTRLG GLVTAALFAG NANILYLQAT AMTELPLFAC LAAATYHLDQ WCRAARSRDL AAASVAVLAA TLTRYEGWVF CAAALVIVIY VELRRHRSYS RTESSLVIFG FLACAGIAAW LVWNLVIFAD PLYWQRGEYA ESSLWVDGND VNVGELDIAS GTYGLATYHN IGLVTLAIAA AGMLLYLIRH RIRREWVAIY PLLGFGPFFV VALYTGQRPL NVVEYTGHFY NVRFGLIMLL PAAVFAGYLI SVGVSAVRRF RFRSLRWTAV AVAVLVTAFG IATVPGVVTL RDAVEFRETD WENAATARWL RDNHDGDLTL MMSFANESVT YDSRIPTESL VYEGTYRLWE RALADPHAQG IEWIYMRALP DAEDLVWHAL YDTPQLEDHY TLVYEFHDRL VFRSTKALTE ESEGEDD
|
| |