Gene Snas_5739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5739 
Symbol 
ID8886955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp6101562 
End bp6103613 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content72% 
IMG OID 
ProductYhgE/Pip N-terminal domain-containing protein 
Protein accessionYP_003514462 
Protein GI291303184 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.979212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAAGC CGTTCGCCGA TTCGGCCCGC GCCATCGCTT CCGGGCCGCT GACCTGGAAG 
ACCTGGACCG GGCTCATCAC GGTTCCGGTC CTCATCCTGT CGCTGTTGAC GTGGGCGTAC
TGGTCGCCCG GTGCTGATCA CGGCACCGCC ACCGCGGCCA TCGTCAACGA CGACAAGCCC
GTCAAGGTCA ACGGACAGAC CATCCCCTTG GGACGGCAGC TGGCCGGAAT CCTCAGCCAC
AGCGAGGACT CCGCCTATCG GTGGGTTCTC ACCGACGCCG ACGACGCCGA GGCCGGGCTG
GCCGACGGCG GTTACGCCGC CACCGTCACC ATCCCGGAGG ACTTCTCGCG GCGGGCCACC
TCCACCGCGA CCCAGCCGCC GCTCGACGCC GCCCGGGCCA CGCTGCGGGT CCGCACCTCC
GACGCCACCG GCGCCGCCGA CCCGCGACTG GTCACCGCCA TCGCGACGGC AACCCAGGAC
TCCCTCGACA ACCAGATCGT CGAGACCTAT CTGGACAACA TCTACGTCGC GTTCACCACC
ATCCACGACA AACTCGGCGA AGCCGCCGAC GGCGCCAGCC AGCTCGCCGA CGGAACCTCG
CAGCTGTCCG ACGGCGCCGG GGAACTGGCC GACGGGGCCG CGCAGCTCGA CACCGCCACG
GCCGAGCTGT CGCGCGGTGC CGACCAGCTC GCGTCGGGTA CCGGCGAACT CGCCGACGGC
TCAGCGCAAC TCTCCAGTGG ACTGTCGCAG GCCGAGCGGG ACACCGCGCA GTTGCCCCGG
CTGACCCAGC GGCTGGCTGA CGGCGCCGAG CAGGTCGCGC AGGGAAACGA ACAGCTCGCC
TCGGTCGTGG TCCCGCTGGC CAACCGCATC ATCGACGCCA TCGACGCGCT GCCGTCCGCG
CGCGAGGCGG CCCGGGAGTT CCGGCGGCTC GCCGATGACT GCGCCGCCGG GGGCGGCAGC
CCGGGATTCT GCCGGGGACT GGACCGGGCC GCTGACCGCT TCGAGACCAA CGCCGGAAAA
CTGGACGGTG CCAGGGAAAC CATCCGGCAG GCCGCCGTCG ACGCCCGCGA CGCGGTGCGG
GACCTGGCGT CGGGGGCTCG GCAGGTCGCC GACGGCAACG CCACCCTCGC CTCCCGGGCC
GGGGAACTGG CCGGGGGTAT CGCCTCGGCC GCCGACGGCG CCCGGCAACT CGACTCCGGC
ATCCAGCAGG CGGACACCGG GGCCCGGCAG CTCGCCACCG GAGCCGGACG ACTGTCCACA
GGAGCCGGTA ATCTGTCCAC AGGGGCGGAC CAGCTTAGCG ATGGCGCCAA GGAGGTCGAC
GACGGTGCCC GCGAGCTGGC GTCCGGCCTG AACCAGGTCC GCGACCAGGT CCCCAGCTAC
ACCAAGGACG AACGCGCCCA CCTCAAGACC GTGGCCGCCG ATCCCACCGA CGCCGACGCC
TCCCGCACCG GCATCGGGCC GCTGGCGTTG ACGCTGTTCG TCGCCATGGC ACTGTGGGCA
CTCGCGTTGG CCACCTACAT CGTGACCCGG GCCGTTCCCG CCGCCGTCCT GACCGCCCGT
GCCGCCACCT GGCGGATCAT CCTGCGCACC GCGGTTCCCG GCTCCTGCGC CGCCGTGTTG
GCCGCGCTCG CGCTCACCGT GATCGCGGTT CCGGTGCTGG GACTGGGAAT CATGCGGGCC
CTGGGCTTCG GCGCCGTCGC CCTGCTGGCC GCACTGACCT TTGTGTCCCT CAACCAGGCC
GCCGTGGCGA TCTTCGGCCG CGCCGGACGA CTGGCGTCGC TGACCGTCCT GCTGCTCACC
GCCGCCACCG GCGTCATCTC CACGCTGCCG TCCCCTTTGT ACGCCGTCGC GGGCTACCTG
CCCACCCACG CCGCGACACT CGCCTTGCGC GGCACCGTGA CCGACACCCA CGCGCTGATG
CTCACCGGAA CCGTCCAGCT CGCCGCCTGG CTGGCCGTCG GGACACTGGC GACCATCGTC
ATCACCGACC GCCGCCGCTA CCTGTCCACG CGACAGTTGC GCCGTGGCAC CGTTCTGCCC
GCCGCGACCT GA
 
Protein sequence
MFKPFADSAR AIASGPLTWK TWTGLITVPV LILSLLTWAY WSPGADHGTA TAAIVNDDKP 
VKVNGQTIPL GRQLAGILSH SEDSAYRWVL TDADDAEAGL ADGGYAATVT IPEDFSRRAT
STATQPPLDA ARATLRVRTS DATGAADPRL VTAIATATQD SLDNQIVETY LDNIYVAFTT
IHDKLGEAAD GASQLADGTS QLSDGAGELA DGAAQLDTAT AELSRGADQL ASGTGELADG
SAQLSSGLSQ AERDTAQLPR LTQRLADGAE QVAQGNEQLA SVVVPLANRI IDAIDALPSA
REAAREFRRL ADDCAAGGGS PGFCRGLDRA ADRFETNAGK LDGARETIRQ AAVDARDAVR
DLASGARQVA DGNATLASRA GELAGGIASA ADGARQLDSG IQQADTGARQ LATGAGRLST
GAGNLSTGAD QLSDGAKEVD DGARELASGL NQVRDQVPSY TKDERAHLKT VAADPTDADA
SRTGIGPLAL TLFVAMALWA LALATYIVTR AVPAAVLTAR AATWRIILRT AVPGSCAAVL
AALALTVIAV PVLGLGIMRA LGFGAVALLA ALTFVSLNQA AVAIFGRAGR LASLTVLLLT
AATGVISTLP SPLYAVAGYL PTHAATLALR GTVTDTHALM LTGTVQLAAW LAVGTLATIV
ITDRRRYLST RQLRRGTVLP AAT