Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_0477 |
Symbol | |
ID | 8881660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 497403 |
End bp | 498803 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | FG-GAP repeat protein |
Protein accession | YP_003509285 |
Protein GI | 291298007 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00503985 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGAAAAT CACGTTTGTC CCCCCGTGCC CGCAAGGCGG GGGTGATCAC ACTGGCCGTA GCGGTCGGCG TGACGACGGT GGGAGCAGCG GCCGTGGCGT ACGCCGACCC CGCCGAATCC TCCGTGCCCA CGTCGTCGGA CTTCGACGGC GACGGCAAGG ACGACCTGGC CATGTCGGCG CAGAAGACCG ACGAAGCCGC CGAGGACTCG GTCGTCATCG ACTACACCAC CGGTCTGGCG AACAAGGAGC TGTACCCGGA GTCGGCCTAC GGCACCGACG GTTTCGGGGT GGGACTGGCC GCGGGGGACC TCAACGGCGA CGGCTTCGAC GACCTCGCGG TCGGCTGCGT CAACTGTGAC TGGGAATGGG GCGGCGCGAC CGTCTCCATC TACAACGGCT CCGCCGAGGG CCTCAAACCC GACTCGGCGG TCAACGCCGA GGTCGGCGAC CCGACCTACG CCGTCGGCAT CGGTGAACTC AACGGCGGGG GAAGCCTGGA CGTCGGCTCG ACTCGGCTGG GCGACGCCAG CGCGGTCTCC TCGCGCGGCG ACGACGGCTG GTGGTCCGAC AAGTGGGTCA ACACCGGGAT GCCGACCGAT GAGAACCGGC TGGGCTCGGT GGCCATCGGC GACGTCAACG GCGACGGCAA GGACGACCTG GTCATCGGCA CCCCCACCGC CGACGGTGGT TCGATCACGC TGTTCCCCGG CCCGGTGACC GAGGGCAAGA AGGACACCGT CAAGGCCGTC GAGCTCAGCC CGACGCTGCG CGACCTGGGC GCCTCGCTGG CCGTCACCGA CGTGACCGGC GACGGTCTGG CCGACGTCAT CGCCGGGGCC CCGACCTCGA CGGTCGGCGG CACCAGCTGC GGCGCGGTCC AGCTGCTGAT CGGCAAGACC AACGGCATTG CCGCCGACTT CAGTCAGCGG CTCACCCAGG AGAGCGCCAA CATCCCGGGT GTCTGCGAAG CCGGTGACGA CTGGGGCCGT TCGGTGGCCG CGGGCAACGT CGACGGTGAC GCCGGCGCCG AGGTCGTGGT CGGGGTCCCC GGCGAGGGCA TCGACTCGCT GGGCAAGGCC GGTACCTACA CCACGCTTCA GTCCACTTCG ACCGGTCTGA CCGGCACCGG TTCGTTCGGG GTCTCGCAGG CCACCGCCAA CGTCCCGGGA ACCGCGGAGT CCGGTGACGG CTTCGCCTCC GCGCTGGCGC TGCGGGACGT CAACGACGAC GGCCGCATGG ACGTCGTCAT CGGTGCCCCC ACCGAGGACG TCTCCACCGT CAAGGACGCC GGACAGGTCG TCACGGCGCT GTCCAGCGCC ACCGGCGCGC CCGCCGCGGG CACCACCGAG GTGACCGGCA ACAAGTACGG GCTCAAGCGA TTGGGCTGGG AACTGGCGTA G
|
Protein sequence | MRKSRLSPRA RKAGVITLAV AVGVTTVGAA AVAYADPAES SVPTSSDFDG DGKDDLAMSA QKTDEAAEDS VVIDYTTGLA NKELYPESAY GTDGFGVGLA AGDLNGDGFD DLAVGCVNCD WEWGGATVSI YNGSAEGLKP DSAVNAEVGD PTYAVGIGEL NGGGSLDVGS TRLGDASAVS SRGDDGWWSD KWVNTGMPTD ENRLGSVAIG DVNGDGKDDL VIGTPTADGG SITLFPGPVT EGKKDTVKAV ELSPTLRDLG ASLAVTDVTG DGLADVIAGA PTSTVGGTSC GAVQLLIGKT NGIAADFSQR LTQESANIPG VCEAGDDWGR SVAAGNVDGD AGAEVVVGVP GEGIDSLGKA GTYTTLQSTS TGLTGTGSFG VSQATANVPG TAESGDGFAS ALALRDVNDD GRMDVVIGAP TEDVSTVKDA GQVVTALSSA TGAPAAGTTE VTGNKYGLKR LGWELA
|
| |