Gene Snas_5240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5240 
Symbol 
ID8886449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5562960 
End bp5564183 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003513967 
Protein GI291302689 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.111977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.759757 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACG AGACGCAACG CCAGCAGAAC CCCGACGAGG CACGACGGCA GCAACAGAAC 
TCCGAGGAGA CGCAGCGGCG GCAGGACCCC GAGAAGACGC AACGGCAGCA GGACCCCGAC
GCGGCACAGC GGCAGGTCCC AGACGAGGGG CTGCGGCAGC AGAATTCCGA CGCGGCGCAG
CGGCAGCAGA TTCTCGACGA GGCGCTGCTT CGGCTGCACA CCACCGGACC CGAGTTCGAC
GACTGGTTGA GCAACCACGC GCCGATGGCC GTTGAGGCGT TGGCCCATCA CGGGCACGCC
GATCGGGTTC ACCACTGGAT CGATGAGTAT GAGCACGTGT TGGAGGAGGC GCCGCGGCCG
TCTGATCCGA TCACCGTCGA GAATTGGCGC GAGGCGCTTG GGGATCCCCG GCGGCTTGGG
GACTGGCCTT CGTGGTTCGA CAACGAGCTG GCCGAGGCCG AGTGGACCGA GGTGTTGGCC
CGGTGGTGGC CGCGGTTGTT GCCCGGGATC GTCGCCAGTG CCACCCATGG GGTGATTCGG
GTCGGCCATG CCGTGCGCAC GTTGCGGGAG CAGGGGCCCA ACGAGGCGCG GTTGGCGGAG
TTGGCGCGGG CCTTCGGGTA CTGGGCGGCG CGGTGGCAGC CCATGGTGCG GCCGACGGCG
CCGTCCGGTG GGCTGGACGC CGCGAGTGCG CTCGCGCGGG TGCCCCGGAT TCCCGAGCAG
GACGGTGGGA TCCGGGATCG GCTGGCGCAG TTGGACGGGC TGGCGGGCTG GGAGTCGGCG
CAGGCCGCGC TGCGGTTGCC CGGCGAGGCC GAGGCAGTGC CGGACGCCGT GCGCGCGATC
GTGACCGCGG CGGTGAACCG GTACCTGAGC CACGGGCACG GTTCGGCGGT GATGCTGGTG
CACGCCGCGA CCGCCCCCAA CGCGGTGCTG CGGGTGTTGC CGTCGCTGCC AAGGGAGCAT
TGGCACGACA GTGCCGCGTT CGCGTGGTCG GCTTCGGCGG CGGTGATGTC GATCTACGCT
CCCGCCGAGG CAGCGCCGAC AACGGAACTG CCGCAGGCTC CGGACGGTTC CGGCGCGGAG
GAGGAGATCT TCGACGCGGC GGCGGCCAAC GGTGACGCGC ACGTGATCAA GTTCGCCGAC
ACCGCGCTGG ACGTGCGCTC CTGGACCGGG GACGCGACGC CGTTGGCGGC CGCGCTACGG
TCGGCACAGC TGATCGGGGA TTGA
 
Protein sequence
MTDETQRQQN PDEARRQQQN SEETQRRQDP EKTQRQQDPD AAQRQVPDEG LRQQNSDAAQ 
RQQILDEALL RLHTTGPEFD DWLSNHAPMA VEALAHHGHA DRVHHWIDEY EHVLEEAPRP
SDPITVENWR EALGDPRRLG DWPSWFDNEL AEAEWTEVLA RWWPRLLPGI VASATHGVIR
VGHAVRTLRE QGPNEARLAE LARAFGYWAA RWQPMVRPTA PSGGLDAASA LARVPRIPEQ
DGGIRDRLAQ LDGLAGWESA QAALRLPGEA EAVPDAVRAI VTAAVNRYLS HGHGSAVMLV
HAATAPNAVL RVLPSLPREH WHDSAAFAWS ASAAVMSIYA PAEAAPTTEL PQAPDGSGAE
EEIFDAAAAN GDAHVIKFAD TALDVRSWTG DATPLAAALR SAQLIGD