Gene Snas_5801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5801 
Symbol 
ID8887017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp6162841 
End bp6164028 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content66% 
IMG OID 
Productcytochrome P450 
Protein accessionYP_003514524 
Protein GI291303246 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0291832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.365285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACG AACCCTTCAA CCTCGTGATG TTTCAGCGCG ACGGTCTTGA TCCCGTGCCT 
GAACTGGCGC GCCGCCGTGC CGAAAACCCG GTGAGCAGGG TGCAATACCC GATCGGCCCG
CCGATTTGGC TGGTGACCGG CTACGAGGAC ACCCGTACCG TGCTCGGGTC GAACAAGTTC
AGCAATGACT TCGCCAAGAT GACGGCTGAA GACGACCTCG CCTTCCTCAA GGACGTCAAC
CCGGGTGGCC TGGGATTCAA GGATCCGCCC GACCACACCC GGCTGCGCAA GATGCTCACA
CCCGAGTTCA CGATGCGGCG GCTGCGGCGG CTGATCCCGC GTATCGAGGA GATCGTCGCC
GAACGCCTGG ACGCGATGGA GGCCGCCGGG GACGGCGTCG ACCTGGTCGA CGCGTTCGCG
GTGCCGATCC CCTCCCTGGT GATCAGCGAA CTGCTCGGTG TCCCGTACCC GGACCGCGCC
GACTTCCAGC GGCTGTCGGA GTCCCGTTTC GACTTCCTGG GCGACATCGA GGGCTGCCTG
GCCGCCGTTC AGGACACTTT GGAGTACCTG TCCGGCCTGG TGGCGCAACA GCGCGCCGAA
CCGGGGGACA ACCTGCTGGG CATGCTGGTG CGCGAACACG GCGACAACAT CTCCGACGCC
GAACTCACCG AGATCGCCGA CGGCATCCTC ATCGGCGGCC ACGAGACCAC CGCGAGCATG
CTGGCACTGG GCGCCCTGCA CCTGATGACC AAACCCGAGC ACTTCGCGAT GGTCCGCGAC
GACGACGACA AGGTCGTCCC GGTCGTCGAC GAACTGCTGC GCTACCTGAC CGTCGTGCAG
GTGGCCTTCC CGCGGTTCGC GCTGGAGGAC GTGAAACTGT CCAACGGCCA GGTCGTCCGG
AAGGGCGAGG TCGTGCTGGC CTCGCTGTCG GGCGCCAACC GCGACTCCGC CTTCGGCGCG
GACGCCGAGA AGGTCAACAT CTTCCGCGAC ATGCCGCCGC ACGTGGCCTT CGGCTACGGA
CTGCACCGCT GCGTCGGTGC CGAACTGGGC CGCATCGAAC TCCAGATCGC CTACCCGGCG
CTGCTGCGCC GGTTCCCGAA CCTGCGGCTG GCGGTGCCGT TCGAGGAACT GAAGTTCCGC
GAACTGTCCA TCGTGTACGG AGTCGAGAAG CTGCCGGTGA ACCTGTGA
 
Protein sequence
MSDEPFNLVM FQRDGLDPVP ELARRRAENP VSRVQYPIGP PIWLVTGYED TRTVLGSNKF 
SNDFAKMTAE DDLAFLKDVN PGGLGFKDPP DHTRLRKMLT PEFTMRRLRR LIPRIEEIVA
ERLDAMEAAG DGVDLVDAFA VPIPSLVISE LLGVPYPDRA DFQRLSESRF DFLGDIEGCL
AAVQDTLEYL SGLVAQQRAE PGDNLLGMLV REHGDNISDA ELTEIADGIL IGGHETTASM
LALGALHLMT KPEHFAMVRD DDDKVVPVVD ELLRYLTVVQ VAFPRFALED VKLSNGQVVR
KGEVVLASLS GANRDSAFGA DAEKVNIFRD MPPHVAFGYG LHRCVGAELG RIELQIAYPA
LLRRFPNLRL AVPFEELKFR ELSIVYGVEK LPVNL