Gene Snas_1406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1406 
Symbol 
ID8882593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp1489707 
End bp1490918 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content74% 
IMG OID 
ProductPHP domain-containing protein 
Protein accessionYP_003510206 
Protein GI291298928 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.407213 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCCG AGACCACACT GACCGGTGTC TGGACGCCCG GCGATCGGGC CGAGGGCTTC 
TACCGGTACC TGCCGTTCGA GGTGCCCACC GGGACGAACG CGGTGCGGGT GCGGCTGGAG
TACCCGCGCG CGGGCGGCGT GCTGGACCTG GGTTGCTTTG GCGCGGAAGG CTTTCGGGGC
TGGTCGGGCG GCGCCCGCGA CCGCTGTGAG ATCGGCGAGT CGGCCGCGAC CCCCGGCTAC
CTGCCGGGCG AACTCGAAAC CGGACAGTGG AATGTGGTGC TCGGTCTGCA CCGGGTGCCG
CAACCGCTGG AGTACACCGT CACGATCGCG ACCTCGGCCG ATCCGGTGAC CCGCGTCGAG
GAACGGGTGC CGGTCGCCAC CGAGCGCCGT CCCCGCCGCG ACCTGCCCGC TCCCACCGGG
ATGCGTTGGG CGGCGGGCGA TCTGCACTCC CACTCCGAGC ACTCCGACGG GACGCTCAGC
CTCGACGCGC TCGCCGCGTC GGCGGCGTCC GCCGGGCTGG ACTTCCTGGC GGTCACCGAC
CACAACACCG TCAGCCACCA CCCGCACCTG GCCGCCGCCG GTGACCGGCA CGGCGTCCTG
CTGCTGCCCG GCCAGGAGGT GACCACCGAA CGCGGGCACG CCAACGCCTT CGGGCCGCTG
CCGTGGATCG ACTTCCGCCG TCCGGCCCAA CACTGGCTCG AGACCACCGA GGCCGCCGGT
GGCCTGCTGT CGGTCAACCA CCCCATCGCG GTCGACTGCG CCTGGCGCCA GCACCTGTCC
CGCCCCGCCC CGCTGGCCGA GGTGTGGCAC TGCACCTGGC GCGACCGCAC CTGGAGCGGC
CCACTGGCGT GGTGGCTGGC CAACGGCACG GCCACCACCG CGATCGGCGG CTCCGACTTC
CACGAACCCG GCCGCGACCG CCCGCTCGGC CAACCCACCA CCTGGGTCCT GGTCCCCGAC
GGCGAGCCGA CGGTGGCGGC GGTCCTGGAG GCCCTGCGAA CCGGCACGGT CGCGATCGCG
GCCGACATCG ACGGCCCGGC GCTGCTGCGC GTCGAGGACG AACTGGTCGC GGTGGCCGCC
GACGGCGCGA TCCTCAGCGA CTACTCGGGA CGACGCCGAC TGGTGCGCGG CGAGACGGCC
CGGTTCCCCG CCCCGGCGGG ACCGCACTGG CTGGAGGACT CCCGCACCAC GGTGCTGGCG
ATCGCGAACT GA
 
Protein sequence
MAAETTLTGV WTPGDRAEGF YRYLPFEVPT GTNAVRVRLE YPRAGGVLDL GCFGAEGFRG 
WSGGARDRCE IGESAATPGY LPGELETGQW NVVLGLHRVP QPLEYTVTIA TSADPVTRVE
ERVPVATERR PRRDLPAPTG MRWAAGDLHS HSEHSDGTLS LDALAASAAS AGLDFLAVTD
HNTVSHHPHL AAAGDRHGVL LLPGQEVTTE RGHANAFGPL PWIDFRRPAQ HWLETTEAAG
GLLSVNHPIA VDCAWRQHLS RPAPLAEVWH CTWRDRTWSG PLAWWLANGT ATTAIGGSDF
HEPGRDRPLG QPTTWVLVPD GEPTVAAVLE ALRTGTVAIA ADIDGPALLR VEDELVAVAA
DGAILSDYSG RRRLVRGETA RFPAPAGPHW LEDSRTTVLA IAN