Gene Snas_5036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5036 
Symbol 
ID8886243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5345185 
End bp5346363 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content71% 
IMG OID 
Productglutamate--cysteine ligase GCS2 
Protein accessionYP_003513766 
Protein GI291302488 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.293374 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGTA CGGCTCAGCA TGCTCACATA ACGCGCTTCA TCGACCCCGT TCCCCCCACC 
GCCGGTATCG AGGAGGAGTT CTTCCTCGTC GACCCCCACA CCCGCCGGGT CGCCCCCGAC
GCGGCGGAGG TGGTGCGGCG GGCCGGGACC TGGAGCGGCG GCTCCATCAG CACCGAGTTC
ACCAAATACC AGGTGGAGAC CCGCACCGAT CCGTTCTCCG ATGTGGACGA TCTGGCCACC
GAGGTGGCCC GGATGCGCGA CATCGCCGCC ACCGCCGCGA CCGAAGCGGG GCTGCGCGTC
ACCGCGACCG GCACCCCCGT GCAGGGCGAC ATCGTCCCGC CGCCGATCGC CGACATCCCC
CGCTACCGCG AGACCACCGC GATGTTCCGC ACACTGCAGG ACGGACAGAG CATCAGCGCC
TGCCACGTCC ACGTGCACAT GCCCGACCCG GAGCTGGCGG TACTGGTGAG CAATCACCTG
CGACCCTGGC TGCCGGTGCT TTTGTCCATG AACGGGAACT CGCCCTACTG GGCCGGACGC
GACACCGGCT ACGCCAGCTG GCGGACGCTT TCCTGGAGCG GCTGGCCGGT CGCCGGTCCG
CCGCCGTACT ACGAATCGCG CGACCACTTC GACGAACTCG TCGGCACCCT GGTGGCCGGT
GGCGCCCTGA TGGACCGTCG CTCGATCTTC TGGGACGTCC GCCCCTCGGC GCATCTGCCG
ACCATCGAGG TCCGGGTGGC CGACGTCGCC GCGACCGCCT TCGAGGGGCC GCTGTTCGCC
GCGCTGGTGC GTGCCCTGGT GACGCTGGCG GCCCAGGCGG TGCGGCACGG TGACCGGGGC
CCGAGGACCG CCCCGGAACT GTTGCGGGCC GCGAGCTGGC GCGCCGCGCG AGACGGCTTG
GAGGGCAAGG GAATCGACAC CCGCACCGGG AAACTGCGCG ACGCCGGGCA GCTGGTCGAG
TCGCTGCTGG CCGAGGTCCG TCCGATCCTG ACGGTCTGGG GAGAGTGGGA CCGGGTGACC
GGCTGGTGGC AACGGCTGCG GTCAATCGGC AGCGGTGCCG CCCGGCAGCG CGCGGCCTAC
GCCGAGCGCG GTCACCTGGA CGACGTCGTC GACTGCCTCA TCGAACAGAA CCGGCCCGGT
TCGCTCCACA AGGGGCGCAC CGTGGTCCAT AGTGGCTGA
 
Protein sequence
MAGTAQHAHI TRFIDPVPPT AGIEEEFFLV DPHTRRVAPD AAEVVRRAGT WSGGSISTEF 
TKYQVETRTD PFSDVDDLAT EVARMRDIAA TAATEAGLRV TATGTPVQGD IVPPPIADIP
RYRETTAMFR TLQDGQSISA CHVHVHMPDP ELAVLVSNHL RPWLPVLLSM NGNSPYWAGR
DTGYASWRTL SWSGWPVAGP PPYYESRDHF DELVGTLVAG GALMDRRSIF WDVRPSAHLP
TIEVRVADVA ATAFEGPLFA ALVRALVTLA AQAVRHGDRG PRTAPELLRA ASWRAARDGL
EGKGIDTRTG KLRDAGQLVE SLLAEVRPIL TVWGEWDRVT GWWQRLRSIG SGAARQRAAY
AERGHLDDVV DCLIEQNRPG SLHKGRTVVH SG