Gene Shewana3_3685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_3685 
Symbol 
ID4479886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp4423871 
End bp4425409 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content50% 
IMG OID639728290 
ProductSel1 domain-containing protein 
Protein accessionYP_871310 
Protein GI117922118 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTGA TACTTATCAT TATAGCCTTT GTCGTTCTGT TTATTCTTTT TGCCAAATAC 
CAAAAAAAGA AGATACAAGC CGAAGCGCTT GCCACGGGGG AACCCTCGGC GCTACTCAAT
CACGGACTCA CATTAATCAA TCAGGGCGAA GTCAACTCGG GCCTAGACTA TATTCAGCAG
GCCGTTGATA AAGGCTTCGC CGTAGCCGCC ATTGCCATGG CCGAGCTGTA TTCGGGTCGA
TTCCCACAAG TCCCCGCCGA TCCTAAGGCA TCTGAATATT GGTATCGAAA AGCCGCCGAA
ATCGATCCGC AATATTTACC CATGCTCACC CTGACTAGCT TGGTCGCCTC AGAGGCGCAA
ACACCAGAGC AGTTACGCGA GCAGGTAGAA CAACTCAAAG CCAGTGCTGA AGTGGGGCAA
GTCGAGTTTC AATATGAGCT CGCCTATTTG TATCTCAGAC AAGCCTTTCT CGATCCCGAT
GCAAGCCAAG CCATTTACTG GTTCGAAAAG GCGGCCGCCC AAGGCAACCA AGATGCCTAT
TACCACTTAG GTACGCTGTA TTGGCACGAT GAACGTGTCA CGCCAGACTA TAGCAAAGCG
CGGGAATACT TTGAGAAAGC CGTCGCCGCC GGTGATGAAC ACGCAAAAGA TAACTTAGGC
CATATGCTGG CCTCAGGCCA AGGCGGCCCT AAGGATCTCG TCCGCGCCGA AGCCTTACTC
AGCGAATATG CTGCAGAAAA TGATTTTCGC CAGTATTACC TAGGCAAACG TTTCCTCTAT
GGTGAAGATT TTGCCGTCGA CTACGGCAAA GCCCGCCACT GGTTAGAAAA ATCCTGCGCC
ACTGACAATG TCTTTGCCAA GTTAGCGCTG GCACACCTTA AGCTACTCGA CCCACAAACG
GATGATGACT ACCAGCAGGC AAGAACAGAA TTCGAGACGC TGGCGTCCCA GTGGCAAGAG
GAAGCCCTAT TCGGTCTAGG TAAAATCTAC GAGGAAGGCT TAGGCGTCTC GCGCCAACCG
ATTAAAGCCT TAATGTATTA CCAACTGGCG GCCATGAGCC ATATCAGTGA CTATCAAGAC
GCATACGAGA AGCTCAGTAA GCGCTTGGGC ACCTTAGAGA TCCGCGAAGC CCAGAGTCTG
TGTAATAACT TTCTGCATCA ACACCCTATT CCCGATGAGC AGCAAACTTA TTACTACCTT
AACCAAGCAG AAATCTACCG CAAGGGTGAG CATCCGAGCC GCGAAGCGCT GCAAACCGCA
GAGACTTGGT ACCGCAAAGC CGCCGACTTG GGCAGCCAAG ATGGCATGCA AGCCCTCGTT
GACATCAACC GCCATGAGTC GATTGATAAA CCGGTTCAGA CATACATTTG GTCTAGCATA
TTACTGCGCA ACTTTGGTCA ATACGGGATG AATAGCGATC AACTCTTGTA TCAACAACAG
GCCCTTTCCC GCTTAACGGA GTCCGAGCTG TTGTATGCCC AAGATGAAAT CGAGAGGATT
GAAACTCAGC TAGCGCCTTA CTTACAGAGC AACGAATAA
 
Protein sequence
MSLILIIIAF VVLFILFAKY QKKKIQAEAL ATGEPSALLN HGLTLINQGE VNSGLDYIQQ 
AVDKGFAVAA IAMAELYSGR FPQVPADPKA SEYWYRKAAE IDPQYLPMLT LTSLVASEAQ
TPEQLREQVE QLKASAEVGQ VEFQYELAYL YLRQAFLDPD ASQAIYWFEK AAAQGNQDAY
YHLGTLYWHD ERVTPDYSKA REYFEKAVAA GDEHAKDNLG HMLASGQGGP KDLVRAEALL
SEYAAENDFR QYYLGKRFLY GEDFAVDYGK ARHWLEKSCA TDNVFAKLAL AHLKLLDPQT
DDDYQQARTE FETLASQWQE EALFGLGKIY EEGLGVSRQP IKALMYYQLA AMSHISDYQD
AYEKLSKRLG TLEIREAQSL CNNFLHQHPI PDEQQTYYYL NQAEIYRKGE HPSREALQTA
ETWYRKAADL GSQDGMQALV DINRHESIDK PVQTYIWSSI LLRNFGQYGM NSDQLLYQQQ
ALSRLTESEL LYAQDEIERI ETQLAPYLQS NE