Gene Shewmr4_1402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1402 
Symbol 
ID4251421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1632436 
End bp1634481 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content53% 
IMG OID638118001 
ProductTPR repeat-containing protein 
Protein accessionYP_733537 
Protein GI113969744 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000103328 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCTGC ATTTTATTCG TCCAGAATGG TTACTCGCCC TGTTGCCACT CGCCATCATC 
CTGTGGGCGC TATGGCGCCA ACACCAGAGT AACAGTGCTT GGAATCGCTA TATTGCGCCG
CACTTGGCCA AAATATTGGT CACAGAGGGC ACGCAAAAAT CCCGTCGACC ATTGCATATC
TTGGCTTTCA CTTGGGTGAT TGCCACCTTA GCCTTAGCGG GCCCCGCCCT CAATAAGCAA
ACCTTGCCGG TTTTTGCCGC CGAGCAAGGC CGAGTGTTGG TGATGGATAT GTCTGTCTCC
ATGTTTGCCA CCGATCTGGC GCCCAACCGC TTAACCCAAA CCAAGTTTCG GGCGACGGAT
TTGCTCCGCG GCTTAAAAGA AGGCGAAACC GGCCTGATTG CCTTCGCTGG CGATGCCTTT
ACCATAAGCC CGCTCACCCG AGACACGGGC ACGCTGCTTA ATTTATTACC CACGTTAAGC
CCTGACATTA TGCCCGTGTT GGGCTCAAAC CTAGCGGCCG CGCTAACCCA AGCTAAAAAT
CTACTGGCAC AGGGCGGGCA TCTACGGGGC GATATTATTG TGATGACAGA TGGCATCACC
CCGAGGCAAT TCGATGAGGC TAACTCTGCC TTAGCCGGTA GCCAATATCG CCTCGCGATC
ATGGGATTTG GTAGCCCCCA AGGCGCGCCC ATCCGCTTAC CCGATGGCCA ATTACAGCGT
GACAGCAGCA ATGAAGTCGT TGTCGCCAAA ACCGACTTTG GCTTACTGCA AAAATTAGCC
GATAACCATA ATGGCATCAT GATCCCCAAT CGTGCCGATG GTCAGGATTT AGCACAGTTG
CAACATTGGT TGAGTGACAG TGGCGATGCC AAAGCCACGG ATCTCGATGG AGAAACCTGG
CAAGATCTCG GCCCTTACCT CGCCTTACTC TTACTCATCC CTGCCCTGCT GAGCTTTAGG
CAAGGCATGC TCGCTAACTG GCTGCTGATG GGATTGGGCG GTCTATTGCT CAGCGCTGCG
CCGCAAAGCG CCCATGCCAG CGCATGGGAC TCACTCTGGC ACACCCAAGA GCAACAGGCG
ATGCAGGCCT ATCAAGCTGA GGATTACGCC AATGCCGCAC AAAAATTCGA GACGCCCCAA
TGGCAAGGCG CGGCGCAGTA CAAGGCGGGC GAGTACGAAC AGGCGCTAAA AAGCTTTGAG
CAAGATAGCT CTGCCAATGG CCTTTACAAC CAAGGCAACG CGCTGATGCA ACTCGGTAAA
CCCGACAAAG CCAAGGAGCG TTATCAGGCC GCACTCGACA AGCAGCCCGA GTTTCCACAG
GCCAAGGCCA ACCTTGAGTT AGCAGAGAAA TTGCTGAACC AGCAGCAGTC GCAGCAAAAT
GCTGACAATC AAGATAAACA GTCTCAAGGC GATCAAAACC AACAGGGACA GGACCAGAAC
GATCAGCAGC AGGGAGATCA ACAATCCTCG CAAAACGACC AAGCTCAAGA TCAGTCTCAA
GAGCAGCAGT CCCAACAGCA AAATAATTCT GATCAAGCGG ACAAAAAACC ATCGCAGGAG
CAAAGTACCT CATCAGAGCA GAATGATCCT GAGCAAGGCG CGCAGGATAA ACAGCAAGCG
AGTGATGAAA ACGCCAAGCA AGATCAGCAA GATGCACAGC AGGAACAACA GCAGGCCGAG
CAACAAGCCA ATCAGCAAAA CGGCGCCGAT AACAATGCTG AAGATAAAGA AGATCCAGCC
AGCAACGAGG CAAAAATGCA GGCAAAGGTC GAGGATGATA AGTCCAAAGC CAAGCAAGAA
CAGCAACAGG CCGTAGCGCA AAAAGCGGAT AAAGAGAAAC AGGCGCAGGC GGATAAAAAA
CCAGATACCG CTGTTGAGTC TGTTGAAGCG CCGCCGAGTA ACAGCGAGCC TTTACCCGCG
GAGATGCAGC GAGCACTGCG GGGCGTGAGT GAAGATCCAC AGGTGTTACT GCGTAACAAG
ATGCAACTCG AATATCAAAA ACGCCGTCAA AATGGCCAAA TATCAAGGGA TAACGAACAG
TGGTGA
 
Protein sequence
MSLHFIRPEW LLALLPLAII LWALWRQHQS NSAWNRYIAP HLAKILVTEG TQKSRRPLHI 
LAFTWVIATL ALAGPALNKQ TLPVFAAEQG RVLVMDMSVS MFATDLAPNR LTQTKFRATD
LLRGLKEGET GLIAFAGDAF TISPLTRDTG TLLNLLPTLS PDIMPVLGSN LAAALTQAKN
LLAQGGHLRG DIIVMTDGIT PRQFDEANSA LAGSQYRLAI MGFGSPQGAP IRLPDGQLQR
DSSNEVVVAK TDFGLLQKLA DNHNGIMIPN RADGQDLAQL QHWLSDSGDA KATDLDGETW
QDLGPYLALL LLIPALLSFR QGMLANWLLM GLGGLLLSAA PQSAHASAWD SLWHTQEQQA
MQAYQAEDYA NAAQKFETPQ WQGAAQYKAG EYEQALKSFE QDSSANGLYN QGNALMQLGK
PDKAKERYQA ALDKQPEFPQ AKANLELAEK LLNQQQSQQN ADNQDKQSQG DQNQQGQDQN
DQQQGDQQSS QNDQAQDQSQ EQQSQQQNNS DQADKKPSQE QSTSSEQNDP EQGAQDKQQA
SDENAKQDQQ DAQQEQQQAE QQANQQNGAD NNAEDKEDPA SNEAKMQAKV EDDKSKAKQE
QQQAVAQKAD KEKQAQADKK PDTAVESVEA PPSNSEPLPA EMQRALRGVS EDPQVLLRNK
MQLEYQKRRQ NGQISRDNEQ W