Gene Snas_4550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4550 
Symbol 
ID8885755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4851814 
End bp4853073 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content74% 
IMG OID 
ProductCMP/dCMP deaminase zinc-binding protein 
Protein accessionYP_003513287 
Protein GI291302009 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.757162 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACG AACACTGGCT GCGCCAGGCC ATCGCGCTGG CCGCGAAATG CCCGCCCTCG 
GACACCGCCT TCAGCGTCGG CGCGGTCATC GTCGCCGACG GCCGGGTGCT GGCCACCGGC
TACTCGCGCG AGACCGACCC GCACGACCAC GCCGAGGAGG CCGCGCTGTC CAAACTCGAC
CAGGATCTGA GCGGGGCGAC CGTCTACAGT TCGCTGGAAC CGTGCGGTCA GCGTGCCTCG
CGTCCGGTCA GCTGCGCCGA GCTCATCATC GCCGCCCGCG TTCCCCGGGT GGTGTACGCG
TGGCGCGAAC CCGACACCTT CGTCCAGCCC AAGGGCCTGC GGCTGCTGGC CGAAGCCAGG
GTCGAACTGG TCCAGCTGTC GCACCTGGCC GAAGCGGCCG CCGCGCCGAA CGCGCACCTG
CTGTTATCCG CAACAATGTC ACCCATGCCG TTGATGACCG CCGGACAGCT GCGCGAACTG
CTGGAAACCC AGGCACCGAC CGTGCTCGAC GTCAGGTGGA GCCTGGCCGC GGGCGCCGAC
CGGGACGGTT ACCAGGCCGG ACACCTGCCC GGCGCCGTGT TCCTCGACCT GGACGCCGAC
CTGTGCGGCC CACCCGGACC CGGCGGACGG CACCCGCTGC CCGAACCCGA CGCACTGCGG
GCGGTGCTGC GCTCGGCGGG CGTGACCCGC ACCGGCCCGG TCGTCGTCTA CGACGGCGGC
GACATGCTGG CCGCCGCCCG CACCTGGTGG ACGCTGCGCT GGGCCGGGGT GCAGGACGTG
CGGGTCCTGG ACGGCGGCTT CGCCGCCTGG CAGGCCGAAG GCGGCCCGGT CACCGCCGAG
ACCTCCGACG TCGCGGCCTC CGACTTCGAC GTCACCCCCG GCGGGCTGCC CGAACTCGAC
AGCGCCGCGG CCGCCGCACT GGCCCGCACC GGCACCCTGC TGGACGTGCG CACCCCCGAG
CGGTACCGGG GCGAGACCGA ACCCATCGAC CCGGTCGCCG GGCACATCCC CGCCGCCGTC
AACGCGCCCG CGGGCGACAC GATGGCGCCC GACCACGGCT TCCGCGCGCC CGGCGAACTC
GCCGAGACCT ACCGCCGCTT CGACGGCGAG GTCGGCGTGT ACTGCGGCTC CGGCGTCACC
GCCGCCCGCA CCGCCCTGGC CATGACGGCC GCGGGCCTGG ACACCCCGGC GGTGTACATC
GGTTCCTGGA GCCACTGGGT CGCCGACCCG GCCCGGCCGG TCGCCCTCGG TGATGAATGA
 
Protein sequence
MADEHWLRQA IALAAKCPPS DTAFSVGAVI VADGRVLATG YSRETDPHDH AEEAALSKLD 
QDLSGATVYS SLEPCGQRAS RPVSCAELII AARVPRVVYA WREPDTFVQP KGLRLLAEAR
VELVQLSHLA EAAAAPNAHL LLSATMSPMP LMTAGQLREL LETQAPTVLD VRWSLAAGAD
RDGYQAGHLP GAVFLDLDAD LCGPPGPGGR HPLPEPDALR AVLRSAGVTR TGPVVVYDGG
DMLAAARTWW TLRWAGVQDV RVLDGGFAAW QAEGGPVTAE TSDVAASDFD VTPGGLPELD
SAAAAALART GTLLDVRTPE RYRGETEPID PVAGHIPAAV NAPAGDTMAP DHGFRAPGEL
AETYRRFDGE VGVYCGSGVT AARTALAMTA AGLDTPAVYI GSWSHWVADP ARPVALGDE