Gene Snas_2254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_2254 
Symbol 
ID8883448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp2388767 
End bp2390227 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content69% 
IMG OID 
ProductTAP domain-containing protein 
Protein accessionYP_003511036 
Protein GI291299758 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.267716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAACC CGTTCACGAA ACGGATCAGC CTGGCCGCGA CGGCTGTGGT CGTCGCCGCC 
GTGGCCGCCG TCTACTTCGT CTCCGGTGCC GAAGGGGCGA GCGTGTCCTC GCTCGACTGG
AAACCCTGCC AGGACAACGA CAAGGCCGAC TGCGCCAGCG TCACGGTCCC CATCGACTGG
TCGAAGCGAC AGTCCGGCAC AGTGGACGTC GCCGTCGCCC GCCGTGAGGC CACCGACCCG
AAGCACCGCA TCGGAACGCT GGTCTACGTA CCGGCCGGAC CGGGCAGCTC CGGCGTCGAG
GCGATCACCG ACGACGAGTT CTTCACAATG CTCATCACCT CACCGATCGC CGAACGGTTC
GACGTGATCG GCCTCGACCC GCGCGGCGTC AAACGCAGCC ACCCGGTGAC CTGCGCCAAG
GACCTCGTCG ACAAGGTCGA CGCCGTCGTG CCGCCGACCC GCGAGGGCTA CGACGCCCGC
CTGAAGGGCA ACGCGGCGCT GATGGACGAC TGCCGCGACC GCACCGGGCC GCTGTTCGAC
CACCTCGACA GCGTCAGTGT CGCCAAGGAC ATCGAGGCGC TGCGCTCCGC ACTCGGCGAG
GAGTCGGTCA ACCTGTACGC ACTGTCCTAC GGCACCCTGC CCGCGCAGAT GTACGCCGAA
CAGTTCCCCG ACCGGTTGCG CGGCACGGTC CTGGACAGCA ACATGGACCA CAGTCTGAAC
ACCAAGGACG TCATGGTGAC CGGCGCCGAG GCCACACAGG ACGCCTTCGA TGACCTGGTG
TCCTGGTGCC GGGACGACGA GGAATGTTCG CTGCATGGCA AGGACGTCCG CGCCATGCTC
GCCGACCTGT ACGCCAAGGC CGAAGCCGGG GACCTCACCG AACCCGGCGA CGCGGACACC
CCGGTCACCG TCGCGAACCT GATCAGCGGC ATCGTCTCAC CGCTCGCGGT GACTGACCGT
GGCAAGGCCG CCGACCGGAT CGCCGCGTTG ACGACCGGCA AGGGCAAGGC CGCACCGACC
GAGCAGGCCG ACGGCGAGGT GATGCCGCTT CCCATCACCA TGCAGTGCGC CGACTTCCCG
CGCGGCATCG CCGACTACGA CGCGTTCCGC GACGCCTGGG ACGCGTCCGA GAAGGCCGCC
CCCGACGTCC ACTTCTCACC GTTGAACTGG TCGGTCCCGC AGAACTGCCT GAACTGGGAC
GCGGGCGCCG ACAACCCGAG GCACCGGCTC GACGTGCGGG GCGCCGAACC CGTGCTGGTG
TTGGGATCGC GCTTCGACAC CCAGACGCCG TACTCCTGGA GCGCCAACGT GGCCTCCCAG
ATCGACGGCG CGGTCCTGGC CACCTATGAG GGCCCCGGAC ACGGTGTCTA CCAGCGCACC
GACTGCACCC GGCGGCTCGT CGAGCGCTAC CTGATCCGGG GCGAGACGCC ACGCGACGGG
GTCAGCTGCC CCGCCGCGTA G
 
Protein sequence
MRNPFTKRIS LAATAVVVAA VAAVYFVSGA EGASVSSLDW KPCQDNDKAD CASVTVPIDW 
SKRQSGTVDV AVARREATDP KHRIGTLVYV PAGPGSSGVE AITDDEFFTM LITSPIAERF
DVIGLDPRGV KRSHPVTCAK DLVDKVDAVV PPTREGYDAR LKGNAALMDD CRDRTGPLFD
HLDSVSVAKD IEALRSALGE ESVNLYALSY GTLPAQMYAE QFPDRLRGTV LDSNMDHSLN
TKDVMVTGAE ATQDAFDDLV SWCRDDEECS LHGKDVRAML ADLYAKAEAG DLTEPGDADT
PVTVANLISG IVSPLAVTDR GKAADRIAAL TTGKGKAAPT EQADGEVMPL PITMQCADFP
RGIADYDAFR DAWDASEKAA PDVHFSPLNW SVPQNCLNWD AGADNPRHRL DVRGAEPVLV
LGSRFDTQTP YSWSANVASQ IDGAVLATYE GPGHGVYQRT DCTRRLVERY LIRGETPRDG
VSCPAA