Gene Snas_3154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3154 
Symbol 
ID8884353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3332936 
End bp3334021 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content70% 
IMG OID 
Productdihydroorotate dehydrogenase 
Protein accessionYP_003511918 
Protein GI291300640 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.351968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000224626 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGTACA AACACCTGGT CCGCCCGATC CTGTACCGGA TGGGACGTGG CGACCCCGAG 
GTCGCCCACG AGAAGACCCT CGGGATGCTG ACGGGCGCCA AGCCGTGGAT GCTGTCCGCG
CTGGAGAAGG TCAACAAGGT GTCCGACCCG CGCCGGGTCT TCGGCATCGA CTTCCCCAAC
GCGGTGGGCC TGGCCGCGGG CATGGACAAG AACGGCGTCG CGGTGGCGGC CTGGCCCGCG
CTGGGTTTCG GTTTCGTGGA GCTGGGCACC GTCACCTGGT ACGGCCAGCC GGGCAACCCC
CGGCCCCGGC TGTACCGGCT GCCCGACTCG CAGGCCCTCA TCAACCGGAT GGGCTTCAAC
AACGAGGGCG CCCAGGCCCT GGCCGCCCGG CTGGGACAGC CACCCCGGCC GGCGGTGCCG
GTGGGCATCA GCCTCGGCAA GTCGGCCACC ACGTCGCTGG AGCACGCGGT CGACGACTAC
GTCGCGTCCT TCCACGCCCT GTACCGCTAC GGCGACTATT TCGCCGTCAA CGTCTCCTCG
CCCAATACCC CGGGCCTGCG CACCCTTCAG GACGCGGAGC AGCTGTCGCG CATCCTCAGG
GCCCTTTACG AGGAGGGTGC GCGGTTGTCG GGCGGTGCCC GTCCCAAGCC GATCCTGGTG
AAGCTGGCCC CCGACCTGAC GGAACCGGCG ATCTTCCAGG CGCTGGAGGT GTGCATGACC
CACGGCGTCT CGGGTGTCAT CGCCGCCAAC ACCACCCTGG GCCGCGACAA GGTCGCCAAG
ACCGACACCG ACCGGGCCGC GCAGCCAGGT GGCCTGTCGG GCGCGCCCCT GCGCGACATC
ACCCGCTCCA TCGTGTCGTT CATTCACCGG GAAACCGCCG GACGACTGCC GATCATCGGC
GTCGGCGGCA TCACCTCGGC CGATGACGCG GTGCGGTTGG TCGACGCCGG TGCCAGTCTT
GTGCAGCTGT ACACGGGTCT GGTCTACTCG GGCCCGGCTC TGGTACGCAA ATCCGCGCGC
GCGATCCGCA AGGCCCCGCC CGCGCGTCCC CCCATCCGAC CTCAAGGAGG AGCCCGACGT
GGGTGA
 
Protein sequence
MLYKHLVRPI LYRMGRGDPE VAHEKTLGML TGAKPWMLSA LEKVNKVSDP RRVFGIDFPN 
AVGLAAGMDK NGVAVAAWPA LGFGFVELGT VTWYGQPGNP RPRLYRLPDS QALINRMGFN
NEGAQALAAR LGQPPRPAVP VGISLGKSAT TSLEHAVDDY VASFHALYRY GDYFAVNVSS
PNTPGLRTLQ DAEQLSRILR ALYEEGARLS GGARPKPILV KLAPDLTEPA IFQALEVCMT
HGVSGVIAAN TTLGRDKVAK TDTDRAAQPG GLSGAPLRDI TRSIVSFIHR ETAGRLPIIG
VGGITSADDA VRLVDAGASL VQLYTGLVYS GPALVRKSAR AIRKAPPARP PIRPQGGARR
G