Gene Snas_1096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1096 
Symbol 
ID8882281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp1166714 
End bp1168264 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content70% 
IMG OID 
ProductGMP synthase large subunit 
Protein accessionYP_003509899 
Protein GI291298621 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0602354 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATGC CTCGCCCGGT TCTGGTTGTG GACTACGGTG CCCAGTACGC GCAGCTCATC 
GCCAGGCGGG TGCGCGAAGC GCACGTGTAT TCGGAGATCG TGCCGTCCTC GATCTCCACT
GAGGAGCTTC TGGCCAAACG GCCGCTCGCG GTGATCCTGT CAGGCGGTCC CTCCAGTGTG
TACGCCGACG GGACGCCGAA TCCGGACCCG CAGCTGTTCG AGTCGGGCGT GCCGGTCATG
GGCATCTGCT ACGGCTTCCA GGCCATGGCC GTGGCACTGG GCGGAACCGT CGAGGCCACC
GGCCAGCGCG AGTTCGGCGG CACCAGCCTC ACCGTCACGG GCCAGGGCGC CGTGTTCAGC
GGGCTGCCGG AGCGGCAGTC GGTGTGGATG AGCCACACCG ACCGGGTCTC GGCCGCGCCC
GAGGGCTTCA CCGTCACCGC CAGCACCGAC GTCACCCCCG TCGCGGCGTT CGAGAACGTG
GAGCGGCGGA TGGCGGGCGT CCAGTTCCAC CCGGAGGTGG CGCACACGCC GCACGGGCAG
GCGGTGCTGC GGCACTTCCT GTACGACATC GCCGGTATCG AACCCAACTG GACGATGAGC
AGCGTCATCG ACGAGCAGGT CGAGGCGATC CGCGCCCAGG TCGGCGACAA GCGGGTGCTG
TGCGGGCTGT CGGGCGGCGT CGACTCCTCG GTGGCCGCCG CACTGGTGCA CCGCGCCGTC
GGGGACCAGC TGACCTGCGT CTTCGTCGAC CACGGGCTGC TGCGTTCGGG TGAGGCCGAG
CAGGTCGAGA AGGACTACGT CGCCGTCACC GGCATCAACC TCAAGGTCGT CGACGCCCAG
GAGACCTTCC TCAAGGAACT GTCCGGTGTG GTTGATCCGG AGTCGAAGCG CAAGATCATC
GGCCGCGAGT TCATCCGCGC CTTCGAGAAC GCCGCCCGCG AGATCGACGC CGAGGGACCC
ATCGAGTACC TGGTCCAGGG CACCCTCTAC CCGGACGTGG TGGAGTCCGG CGGCGGCACC
GGCACCGCCA ACATCAAGTC GCACCACAAC GTCGGCGGCC TGCCCGAGGA CCTCAAGTTC
ACCCTCGTGG AACCGCTGCG CACCCTGTTC AAGGACGAGG TCCGCGCCAT CGGCACCGAA
CTCGGACTGC CCGAGTCCAT GGTGTGGCGC CACCCCTTCC CGGGCCCCGG CCTGGGCATC
CGCATCATCG GCGAGGTCAC CGCCGAGCGG CTGGAGATCC TGCGCGCCGC CGACGCCGTG
GCCCGCGCCG AACTGACCGC CGCCGGCCTC GACCGCTCGG TCTGGCAGTT CCCGGTGGTC
CTGCTGGCCG ACGTCCGCTC CGTCGGCGTC GCCGGTGACG GCCGCACCTA CGGCCACCCG
ATCGTGTTGC GTCCGGTCTC CAGCGAGGAC GCGATGACCG CCGACTGGTC CCGCCTGCCC
TACGAGGTCC TCGCCCGCAT CTCCACCCGC ATCACCAACG AGGTCCCCGA GGTCAACCGC
GTCACCCTCG ACATCACCAG CAAACCCCCG GGCACCATCG AGTGGGAGTG A
 
Protein sequence
MTMPRPVLVV DYGAQYAQLI ARRVREAHVY SEIVPSSIST EELLAKRPLA VILSGGPSSV 
YADGTPNPDP QLFESGVPVM GICYGFQAMA VALGGTVEAT GQREFGGTSL TVTGQGAVFS
GLPERQSVWM SHTDRVSAAP EGFTVTASTD VTPVAAFENV ERRMAGVQFH PEVAHTPHGQ
AVLRHFLYDI AGIEPNWTMS SVIDEQVEAI RAQVGDKRVL CGLSGGVDSS VAAALVHRAV
GDQLTCVFVD HGLLRSGEAE QVEKDYVAVT GINLKVVDAQ ETFLKELSGV VDPESKRKII
GREFIRAFEN AAREIDAEGP IEYLVQGTLY PDVVESGGGT GTANIKSHHN VGGLPEDLKF
TLVEPLRTLF KDEVRAIGTE LGLPESMVWR HPFPGPGLGI RIIGEVTAER LEILRAADAV
ARAELTAAGL DRSVWQFPVV LLADVRSVGV AGDGRTYGHP IVLRPVSSED AMTADWSRLP
YEVLARISTR ITNEVPEVNR VTLDITSKPP GTIEWE