Gene Snas_5001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5001 
Symbol 
ID8886208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5307187 
End bp5308467 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003513731 
Protein GI291302453 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.52714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.539309 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCGC CGGATCTGAA ACGACTCGCC GACACGTGCG TCCATTGTGG TTTCTGTTTG 
TCGACGTGCC CCACCTACGA ACTGTGGGGA CAGGAGATGG ACTCGCCGCG CGGGCGGATC
CAGCTGATGA AACTGGGTCT GGAGGGCGCC GAGCTCACCG ATTCCACCGT GAACCACATG
GACCGGTGTC TGGGGTGCAT GGCCTGCGTG ACGGCCTGTC CCTCGGGGGT TCGCTACGAC
GTCCTCATCA CCGCCAAACG CGCCGAGGTG GAGGAACAGC ATCCGCGCAC CGCTTCGGAG
CGCTGGCTGC GGCGGCTGAT CTTCGCGCTG TTCCCGTATC CGCGCCGGTT GCGGCTGTTG
CGGTGGCCGC TGCGGATCGC GCAGTGGCTG CGGTTGGACC GGCTCGCGAC CCGGACGCTG
TCGCGGCGGG CCCCGCGACT GGCGACGATG GCGACACTGG CGCCCCGCGC GGGTGCGCGT
CCCCGACTGC CGCAGCGGAT CGCGGCGTCG GGCGACAAAC GGGCGACGGT CGGCATGCTC
ACCGGCTGCG TCCAGGGCGA GTTCTTCCCG CAGGTCAACG CCGCCACCGC GCGGGTGCTG
GCGGCCGAGG GCTGCGAGGT GGTGATCCCA CCGGGGCAGG GCTGCTGCGG GGCGTTGTCC
CTGCACACTG GACGGCGAGC CGAGGCGACG AACTTCGCCA AGGCCACGAT CGAGGCGTTC
GAGGCCGCCG GGGTCGACAC GATCGTCGTC AACGCGGCCG GTTGCGGCTC GGCGATGAAG
GAGTACGACG AACTGTTCGC CGACGACCCC AACTGGGAAC GACGGGCCCG CGACTTCGTC
GCGAAGGTCC GCGACGTCAG CCAGTACCTG GCCGAACTGG GGCCGCGCGG GCCACGACAC
GCACTGAACC TCACCGTCGC CTACCACGAC GCCTGCCATC TGGCCCACGC CCAACGGGTG
CGGGCACAAC CGCGCGAACT GTTGCGCGGC ATACCCGGGC TCGACGTACG CGAGATCGCC
GACGCCGAGA TCTGCTGCGG TTCGGCCGGT GTCTACAACA TCCTGCAACC GAAAGCCGCG
TCCGAACTGG GCGACCGCAA GGCCGCCAAC GTCCTGGACA CGAACGCGGA ACTGCTGGTC
TCGGCCAATC CCGGCTGCGC CATGCAGATC GCCGCCGCCG TCACACGACG CGGCGAGTCA
CTGCCGGTGG CGCACATCGT CGAAGTCCTC GACGCGGCGA TCCGTGGCGA CGATCCCGCG
AAACTGCTCG ATCGAGGCTA A
 
Protein sequence
MDAPDLKRLA DTCVHCGFCL STCPTYELWG QEMDSPRGRI QLMKLGLEGA ELTDSTVNHM 
DRCLGCMACV TACPSGVRYD VLITAKRAEV EEQHPRTASE RWLRRLIFAL FPYPRRLRLL
RWPLRIAQWL RLDRLATRTL SRRAPRLATM ATLAPRAGAR PRLPQRIAAS GDKRATVGML
TGCVQGEFFP QVNAATARVL AAEGCEVVIP PGQGCCGALS LHTGRRAEAT NFAKATIEAF
EAAGVDTIVV NAAGCGSAMK EYDELFADDP NWERRARDFV AKVRDVSQYL AELGPRGPRH
ALNLTVAYHD ACHLAHAQRV RAQPRELLRG IPGLDVREIA DAEICCGSAG VYNILQPKAA
SELGDRKAAN VLDTNAELLV SANPGCAMQI AAAVTRRGES LPVAHIVEVL DAAIRGDDPA
KLLDRG