Gene Snas_5458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5458 
Symbol 
ID8886669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5789482 
End bp5790702 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content73% 
IMG OID 
Productoxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_003514183 
Protein GI291302905 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTTCCG CTCTGCCCGA CGGTGAAGCC GCTCCCGACG ACGGAGCGCT GCCCGACTCT 
GCCCTGAAAT CCGCCCGGAG CCACGATTTC GGGATCTACG TGCACATTCC GTTCTGCGCC
TCCCGGTGCG GCTACTGCGA TTTCAACACC TACACGGCGG CCGAACTCGG TTCGGGCGTC
AGCCGCGAGT CCTTCCCGCG GTTGCTGGAG GCCGAGATCG ATCTGGCCGC AAGGGTTCTG
GGCGACGTGC CGAGGCCGAT CTCGACGATC TTCTTCGGCG GCGGCACCCC GAGCCTGCTG
CCCGCGTCGG AGCTGACCGG GATCGTGGCG AAACTGGAGT CGACCTTCGG GCTGACGCCG
GACGCCGAGA TCACCACCGA GGCCAACCCC GAGTCGGTGG ACCCCGAGTA CTTCCGCCAA
CTGCGCGCGG GCGGGTTCAC CCGAGTGTCG CTGGGCATGC AGTCCACCGC GCGGCGGGTG
CTGGCGGTGC TGGAGCGGGG CCACACTCCC GGCCGGGCCC TGGACGCCGC GCGCGAGGCC
CGCGAGGCCG GGTTCGAGCA CGTCAACCTG GACCTCATCT ACGGCACCCC CGGCGAGACC
GCCGAGGACT TCGAGGCCAG CCTGCGCGCG GCCGTCGACA CCGGCGCCGA CCACGTCTCG
GCCTATTCGC TGATCGTCGA GGACGGCACC CGGCTGGCCG GGCAGGTGCG GCGCGGCGAC
ATCCCCGCGC CCAGCGACGA CGAGGCCGCC GACCGGTACC TGGCGGCCGA ACGCGTCCTC
GGCGAGGCCG GATTCGAGTG GTACGAGGTG TCCAACTGGG CCCGCTCCGA GGCCGCGCGC
TGCCGCCACA ACCTGCTGTA CTGGCGCGGC GGCGACTGGT GGGGACTGGG GCCGGGAGCC
CACAGCCACG TCGGCGGCGT GCGCTGGTGG AACGTCAAAC ACCCGGCCCG CTACGGGCAG
CGCCTGACCG ACGGACTGTC CCCGGCGCAG GGCCGTGAAC TGCTCACCCC GCCGCAGCGG
CACATGGAGG ACGTCCTGCT GGGCGTGCGG CTGGCCGACG GCCTGGCACT GTGCGCGCTG
GACGCCGACG GACAGCGCAA CGCCCGCAAG GCCGCGGCGG AGGGACTGCT GGATCCGGCG
GCGCTGGCCG CCGATCGCGT CCGGCTCACG CTGCGGGGCC GGCTGCTGGC CGACGCGGTC
GTGCGCGACC TGGTGCCCTA G
 
Protein sequence
MPSALPDGEA APDDGALPDS ALKSARSHDF GIYVHIPFCA SRCGYCDFNT YTAAELGSGV 
SRESFPRLLE AEIDLAARVL GDVPRPISTI FFGGGTPSLL PASELTGIVA KLESTFGLTP
DAEITTEANP ESVDPEYFRQ LRAGGFTRVS LGMQSTARRV LAVLERGHTP GRALDAAREA
REAGFEHVNL DLIYGTPGET AEDFEASLRA AVDTGADHVS AYSLIVEDGT RLAGQVRRGD
IPAPSDDEAA DRYLAAERVL GEAGFEWYEV SNWARSEAAR CRHNLLYWRG GDWWGLGPGA
HSHVGGVRWW NVKHPARYGQ RLTDGLSPAQ GRELLTPPQR HMEDVLLGVR LADGLALCAL
DADGQRNARK AAAEGLLDPA ALAADRVRLT LRGRLLADAV VRDLVP