Gene Snas_4820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4820 
Symbol 
ID8886027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5118545 
End bp5119744 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003513554 
Protein GI291302276 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.906768 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAGC AGGTCTTCGC CGACGCGATC GGCAAGTCCA AGAGCTGGGT GGACAAAGTA 
GAACGGGGAG TGCGCACACT AGACAAGTAC TCGGTGCTCA ACGAGATCGC CGACGCGCTC
GCGATCGACG CGCAACTGCT GCTCGGCCGG GACACCCCGC GTCGCGACGA TCGCTACAAC
TGCATCGACC AGTTGGAGGT CAACAGCATC CGGGCCTCGC TGGAGCGCTA CGAACGACTG
GGCCGGTACC TGGGCGCCGC CGCGCTGGAA CCCGTCTCGG TCTCCGAGTT GCGCAAGTCG
GTGAAACACG CGTGGTTCGC CTTCGAGGGC GCGAACTATC CGGTGCTGGC GCGCACCCTC
ATCGAACTGC TGAAGTCCGC GCCGATCGCC GAGGAACACG CCCCCGAACA CGAGAAGGCC
GAGGCGGCCG GACTGCTGAC GCAGGTGTAC CAGATCGCCT CCTCGGTGCT GCGCAAACTC
GGTGAGGCCC AACTGTCCTG GCTGGCCGCC GACCGGGCCA TCGGGGCGGC GCAACGCTGC
GACGACCCGC TGCTCATCGG CATCGCCACC ATGCGGGTCG GCAACGCGCT GCGGTCCCTG
GGCCGCCACC AGGCCGCCCT CGACCTCAAC GTCCAGGTCG CGCACGGACT GATGACCGAG
ATCGGCTCGG CCGCGCGCGC CCAACCCGAA GCCCTGAGCG TCTACGGAAT CCTGCTGTTG
CAGGGCGCCA TGGCCGCCTC GCTGTCGGGG GACACCGCGA CCACGCGGGA CCTGCTGAAC
TCGGCGGGCC GGGCCGCCAG CCGGGTGGGG CCGGGCGTCA ACCACTACTG GACCTCCTTC
GGGCCCATGA ACGTCGAACT GCACCGGGCC GCCTCGGCGG TGGAACTGGG CGACGGCAGG
CTGGCGCTGC AGATCCACGA GCGACTGGAC CGGGCCGAGC TGGAGGGGCT GGTGCCCGAA
CGCCGGGCCC ACCACTACCT CGACCTGGCC CGGGGCTGCG CCCAGATCGG CGAGTTCGAC
AAGGCGGGCC GGGCACTGGT GGCCGCCGAT CGCAACGCCC CCAGCGAGAT CCGGTGCCGC
CCCATCGCCC ACGAGGTCAT CTCCGACGTG CTGCGCCGCA CCCGGGGCTC GGCCCCGCTG
CCGGTGCGGC AGCTCGCCGA CCGCATGGGC ATCGCCGCGT TATGCCGTTC CAGCCGGTGA
 
Protein sequence
MSQQVFADAI GKSKSWVDKV ERGVRTLDKY SVLNEIADAL AIDAQLLLGR DTPRRDDRYN 
CIDQLEVNSI RASLERYERL GRYLGAAALE PVSVSELRKS VKHAWFAFEG ANYPVLARTL
IELLKSAPIA EEHAPEHEKA EAAGLLTQVY QIASSVLRKL GEAQLSWLAA DRAIGAAQRC
DDPLLIGIAT MRVGNALRSL GRHQAALDLN VQVAHGLMTE IGSAARAQPE ALSVYGILLL
QGAMAASLSG DTATTRDLLN SAGRAASRVG PGVNHYWTSF GPMNVELHRA ASAVELGDGR
LALQIHERLD RAELEGLVPE RRAHHYLDLA RGCAQIGEFD KAGRALVAAD RNAPSEIRCR
PIAHEVISDV LRRTRGSAPL PVRQLADRMG IAALCRSSR