Gene Shewmr7_3023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr7_3023 
Symbol 
ID4257573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-7 
KingdomBacteria 
Replicon accessionNC_008322 
Strand
Start bp3583980 
End bp3585503 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content43% 
IMG OID638123701 
Producttryptophan halogenase 
Protein accessionYP_739064 
Protein GI114048514 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.172651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCA GAAAAATAGC CATTATAGGG GGCGGGACTT CAGGTTGGTT AGCGGCGAAT 
CATTTAGGGC GAGTGTTGCA GGGGCGCTCT GAGCTATCTA TCACGCTGAT TGAATCTCCT
GATATTCCTA TCATCGGAGT TGGTGAAGGT ACTGTACCGT CCATTCGGAA ATCACTACAG
AGTTTTGGGA TCAGTGAATC AGAGTTTATT CGCTCATGCG ATGTGACATT TAAGCAGTCA
ATTAAATTTG TCAATTGGTT AGATAAAGCC CGACATGGTA AGGATAATTT CTACCACCAT
CTATTTGATA TCATCAATTT ACAAGGTGTA GACACTGTTT CTGCTTGGTT GGGCGAGAAA
CAAGGCGATT TTGCTGATTA TGTTTCTCCC CAGCATCTTG TTTGTGAGGC CGCAAAAGCG
CCAAAGCTAA TCACGACACC TGAATATGCT GGAGTGTTAG GCTATGCCTA TCATTTAAAT
GCGGCAAAAT TTGCCAAATT ATTGGCTAAA AATGCGATTG AGAAATTCAA CGTTGAGCAC
ATTTTTGCCA CTGTGCAGGA TGTTCGTTTA GGCGATGATG GTGCAATTTC ATCTTTGCTG
ACTGAGCAAG GGACTCTCTC TTTCGATTTC TATATTGATT GCAGTGGCTT TGAATCGATT
TTGTTAGCTA AGGCACTAAA AGTGCCATTT ATCAGTAAAG CGCATCAACT GTTTATTGAT
ACTGCGTTAG TCGCGCAAAT TCCAACGCAA CCTACGGATA TCATCCCTCC TTATACTCAA
GCAACGGCCC ATCGAGCAGG TTGGATATGG GATATCGCAT TGACCCAGCG CCGAGGCACG
GGATTTGTCT ATTCGTCTAC TCATATGGCG CAGTCAGAGG CAGAACGAAA GTTTGACCAT
TATCTCGGTG GCAAACTTGC CGATGTTCCG CATCGTAAAA TTCCAATGAC AGTGGGCCAT
CGTCAACAGT TTTGGGTAAA AAACTGTGTA GCTTTAGGGT TAGCTCAAGG CTTTTTAGAG
CCGATTGAAG CGACATCCAT ATTATTAACG GATTTCTCTG CACGTTTTCT GGCAGAACGT
TTTCCAGTGC ATACAGATGA TGTTGATTAC TTAGCAAAGC GGTTTAATGA TACCGTGGGC
TACGCATGGG AGCGGGTCGT TGAATTTGCC AAACTACATT ACTGTTTATC GGATAGAACT
GACTCATCAT TTTGGCTCGA TAATCAGGCT TCAGAGACTA TTCCCGAAGG ATTAAAGCAA
CGACTCGAGC TATGGAAATC CTATGGTCCT ATTGCTGAAG ATTTTCCATC TAAGTTTGAA
GTATTTAACT TAGACAATTA TCTATACGTT CTTTATGGCA TGAAATATCA GACTAATTTG
CGTGGCGGCT CATCGACAAC ACAGGCCTCT TTGAACATGT ATATGAACAA AATGTCGAAG
GTAAAACAAC AGATGCGAGA TGGGTTGCCT GAGCATAGGG AATTGCTCGA TAAAATCTGT
ACCTATGGTT TGCAATCTAT TTAA
 
Protein sequence
MTIRKIAIIG GGTSGWLAAN HLGRVLQGRS ELSITLIESP DIPIIGVGEG TVPSIRKSLQ 
SFGISESEFI RSCDVTFKQS IKFVNWLDKA RHGKDNFYHH LFDIINLQGV DTVSAWLGEK
QGDFADYVSP QHLVCEAAKA PKLITTPEYA GVLGYAYHLN AAKFAKLLAK NAIEKFNVEH
IFATVQDVRL GDDGAISSLL TEQGTLSFDF YIDCSGFESI LLAKALKVPF ISKAHQLFID
TALVAQIPTQ PTDIIPPYTQ ATAHRAGWIW DIALTQRRGT GFVYSSTHMA QSEAERKFDH
YLGGKLADVP HRKIPMTVGH RQQFWVKNCV ALGLAQGFLE PIEATSILLT DFSARFLAER
FPVHTDDVDY LAKRFNDTVG YAWERVVEFA KLHYCLSDRT DSSFWLDNQA SETIPEGLKQ
RLELWKSYGP IAEDFPSKFE VFNLDNYLYV LYGMKYQTNL RGGSSTTQAS LNMYMNKMSK
VKQQMRDGLP EHRELLDKIC TYGLQSI