Gene Sama_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1040 
Symbol 
ID4603292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1258107 
End bp1259189 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content57% 
IMG OID639780379 
Productpseudouridylate synthase 
Protein accessionYP_926917 
Protein GI119774177 
COG category[S] Function unknown 
COG ID[COG0585] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00094] tRNA pseudouridine synthase, TruD family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.871922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAGC TGACTTATCT TTACGGCAAG CCGGCTGTAA GCGCCGATAT CCGCACCCAC 
AACAGCGATT TTCAGGTAAA GGAAATCCTG CCGTTCCTGC CCGATGGTGA GGGCGAGCAC
CACCTGCTGC ATATCCGTAA AGATGGCCTG AACACGGCTC AGGTGGCAGA GATGATTTCC
AGATTTGCCA AGGTGCATCC CAAAGAAGTG ACCTTTGCCG GTCAAAAAGA CAAAAACGCC
ATTACAGAGC AGTGGTTTGG CGTGCGCATT CCCGGCAAGG AGACGCCAGA CTGGGCTGCC
ATGAGCAATG AGCAGATGCA GGTACTGTCG TTTGCCCGTC ATGGTAAGAA GCTGCGCACC
GGCGCGCTTT CAGGCAACCG CTTTACTTTG GTGCTGCGCA ATGTTTCAGA CCCAGAGGCG
CTGGTTGCAA GGCTCGAGCT GGTGCGCGAT GGCGGGGTGC CCAACTATTT TGGCGAGCAG
CGCTTTGGTC ATGACGGTGG CAATCTTGTT AAAGCGCGGC AGATGTTCGA AGGCCGCAAA
GTAAAAGACA GAAACAAACG CAGTCTCTAT CTCTCTGCGG TGCGCTCAGA GCTCTTTAAT
CAGGTGGCGA GTGCACGTCT GGCGCGTTTT GGCACAGCGC CCATCGAGGG TGATTGTGTG
ATGCTGTCGG GCTCACGCAG CTTTTTTACG GCCGAAAGTT GGGATGACTC CCTGAATAAA
CGTCTTGCCG AGCAGGACAT TCAGCTGTCG GCGCCACTTT GGGGCCGTGG TGAGCAGCTC
GCCAAAGGCG ACGCCGCAGC ACTTGAATCT TCGGTGCTGG CAGCCTACGA GCTTGAGCGC
AATGGTCTCG AAAAAGAAGG CCTTACCCAG GAGCGCCGAG CACTGCTGCT GCGTCCTGAG
CAGCTGAGCT TCAAGCTGGA TGGCGATGCC CTGACCCTGG ATTTCTGCCT GCCAGCGGGT
GCCTTTGCCA CCAGTGTGCT CAGAGAGCTT TGCGAGTACA CAGATATAAA AGAGCTGGAG
TGGCGCCGGG CGGTGGCCGA GCGTGATGCG CGGGAGGGCG ACAATGCGCC TGCTGGTGAG
TAA
 
Protein sequence
MTELTYLYGK PAVSADIRTH NSDFQVKEIL PFLPDGEGEH HLLHIRKDGL NTAQVAEMIS 
RFAKVHPKEV TFAGQKDKNA ITEQWFGVRI PGKETPDWAA MSNEQMQVLS FARHGKKLRT
GALSGNRFTL VLRNVSDPEA LVARLELVRD GGVPNYFGEQ RFGHDGGNLV KARQMFEGRK
VKDRNKRSLY LSAVRSELFN QVASARLARF GTAPIEGDCV MLSGSRSFFT AESWDDSLNK
RLAEQDIQLS APLWGRGEQL AKGDAAALES SVLAAYELER NGLEKEGLTQ ERRALLLRPE
QLSFKLDGDA LTLDFCLPAG AFATSVLREL CEYTDIKELE WRRAVAERDA REGDNAPAGE